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RACKGROUND OF THE INVENTION 

The present application is a continuation-in-part of PCT/US95/02251, filed February 
21, 1995; which is a continuation-in-part of U.S. Serial Number 08/316,650, filed September 
1 5 30, 1994- which is a continuation-in-part of U.S. Serial Number 08/199,780, filed February 
18, 199^tiie entire text and figures of which disclosures are specifically incorporated herein 
by refer^ without disclaimer. The United States government has certain rights in the 
present invention pursuant to Grant HL-41926 from the National Institutes of Health. 



10 A. Field of the Invention 

The present invention relates generally to the field of molecular biology. More 
0 particularly it relates to latent TGF|8 binding protein (LTBP) genes, compositions and 

m methods of use. 

B. Description of the Related Art 
1. TGF-/3 

nsO Five TGF-/3 family members, which share 66-82% sequence identity, have been 

9 identified (Kingsley, 1994). Whereas TGF-/S1 was cloned from a cDNA library derived 

from human placenta, TGF-i82 was subsequently purified from several mammalian cells and 
tissues, and TGF-/33, -/34, and -/35 were cloned^by low stringency hybridization from 
mammalian, avian and amphibian cDNA libraries, respectively. Peptide growth 
25 factors/cytokines have also been identified that share sequence homology (<40%) with the 
TGF-|Ss (collectively, the TGF-|3s plus these other cytokines make up the TGF-^ 
superfamily). A unifying feature of the biology of these other cytokines {i.e. , the MuUerian 
mhibiting substance, bone morphogenetic proteins, growth and differentiation factors, 
activin/inhibin, Drosophila decapentaplegic complex, and amphibian Vgl protein) is the 
30 ability to regulate developmental processes. In every case where information is available, 
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superfamily members are synthesized as larger precursors that are processed at 
endoproteolytic cleavage motifs, and they terminate with the sequence C-X-C-X. The three 
dimensional crystal structure of the TGF-i82 homodimer was recentiy reported (McDonald 
and Hendrickson, 1993). This work has led to the interesting and novel suggestion that 
TGF-/3 is related to certain peptide growth factors (e.g., NGF, PDGF, v-SIS) in a way that 
could not have been predicted from the deduced amino acid analysis. 

2. Latent TGF-/S Complexes 

Many cell types produce TGF-/3, and ahnost all cells bind TGF-/3 with affinities in the 
picomolar range -e.g., the type I and type II TGF-jS cell surface receptors (glycoproteins of 
53 and 75 kDa, respectively) are present in essentially all cells (Miyazono et ai, 1994). 
Thus, TGF-|8 has powerful effects on most cell types, and cytokines such as TGF-/3 are 
thought to exert broad control the tissue remodeling that occurs during development, wound 
repair, and other situations (Spom et al, 1986; Moses et ai, 1990). (For a comprehensive 
review of TGF-jS effects, see Roberts and Spom, 1990). For example, TGF-/3 was initially 
identified as a factor that stimulated the anchorage independent growth of rodent fibroblasts 
(Assoian et al., 1983; Frolik et~aL, 1983; Roberts et al, 1983). It is now known, however, 
that TGF-/3 acts as a potential growth inhibitor for most cells, i.e., epithelial, endothelial, 
and hematopoietic progenitor cells; both stimulates and inhibits cellular differentiation; 
induces extracellular matrix production by stimulating the expression of matrix 
macromolecules, stimulating the expression of matrix protease inhibitors, and decreasing the 
expression of matrix degrading proteases; inhibits the functional activities of immune cells; 
induces the chemotaxis of fibroblasts, macrophages, and smooth muscle cells; induces 
angiogenesis in vivo; inhibits endothelial migration; induces the expression of cell surface 
receptors for other cytokines {e.g., the EGF receptor); promotes the healing of mcisional 
wounds; inhibits osteoblast proliferation in vitro; and induces new bone formation in vivo. 



A molecular explanation for these complex (and, at times, conflicting) effects is not 



yet available, but hypotheses do exist. Spom et al. (1986) have suggested, for example, that 
the ability of TGF-|8 to stimulate or inhibit the proliferation of mesenchymal cells depends on 
the state of cellular differentiation and the entire set of growth factors operant in that cell 
population. As such, the "biological meaning" of TGF-|8 signal transduction depends on the 
context {i.e., availability and presentation) of other growth factors present in the local 
environment: Fischer rat 3T3 cells transfected with a myc gene and incubated with TGF-3 
and PDGF proliferate in soft agar, whereas the sane cells in the presence of TGF-jS and 
EGF fail to grow (Roberts et al, 1985). 

Whatever the mechanism, the autocrine and paracrine activities of TGF-iS clearly must 
be regulated with precision. One regulatory strategy involves the temporal and spatial 
control of TGF-|3 gene expression. A second strategy involves the production and storage of 
TGF-/3 as a latent complex that is activated only under certain physiological and patiiological 
conditions — e.g., tissue morphogenesis and remodeling, and wound healing. TGF-^1 can 
be isolated from serum and from most tissues as a latent complex (Pircher et al. , 1986; 
Miyazono et al. , 1988; Wakefield et al. , 1988). In this regard, the latent complex has been 
purified from human platelets and characterized in detail (Miyazono et al. , 1988). Following 
a 6-step protocol, the purified complex yielded protein bands of Mr 25,(X)0 and 210,000 on 
SDS-PAGE under nonreducing conditions. After reduction, the 25 kDa band was shown to 
consist of subunits of Mr 12,500. On the other hand, the 210 kDa band consisted of a Mr 
40,000 subunit and Mr 125-160,000 subunit. 

is also secreted from several producer cell lines in culture as a latent complex 
of 235 kDa (Gentry et al., 1987). TGF-j81 is initially synthesized in vitro as a 390 amino 
acid precursor that consists of a signal peptide, an amino-terminal propeptide, and the mature 
growth factor. Two precursor chains associate to form a disulfide-bonded dimer with latent 
activity. Homodimers occur most commonly, but heterodimers may also form (Ogawa et 
al. , 1992). The full length dimer is cleaved at a endoproteolytic cleavage motif, but the 
propeptide dimer {i.e. , the latency associated peptide or LAP) and the mature growth factor 



dimer typically remain non-covalently associated. The mature TGF-|S dimer is now known 
to be the 25 kDa band identified after nonreducing SDS-PAGE of the purified latent complex 
from platelets. In addition, LAP is known to be a component of the 210 kDa band identified 
after nonreducmg SDS-PAGE of the purified latent complex from platelets (i.e. , LAP has 
5 been shown to consist of two of the 40 kDa subunits). 

Togethe- LAP and the mature TGF-/3 dimer form the small latent complex. As 
demonstrated in platelets, small latent complexes may be associated with additional high 
molecular weight proteins, the best characterized of which is the latent TGF-^ binding 

10 protein or LTBP (Kanzaki et al. , 1990). (LTBP has been shown to be the 125-160 kDa 

subunit of the purified latent complex from platelets). Latent TGF-|3 complexes that contain 
LTBP are also known as large latent complexes. In contrast to platelet LTBP, the LTBP 

O produced by fibroblasts typically is a 190 kDa polypeptide. The smaller size of platelet 

5i LTBP may be due to proteolytic processing or alternative splicing (Kanzaki et al. , 1990; 

^5 Tsuji era/., 1990). 

s 

ixi 

ifl 3. LAP and Latency ^ 

ffi TGF-i8 latency results in part from the non-covalent association of the propeptide 

So dimer and the mature TGF-|3 dimer (Pircher et al. , 1984; Gentry et al. , 1988; Wakefield et 
S al. , 1989). A cDNA for the TGF-jSl precursor was expressed in Chinese Hamster Ovary 
(CHO) cells, which do not express LTBP (Gentry et al. , 1988), and ahnost all TGF-/3 
activity recovered from the medium of transfected cells was latent. Use of deletion 
constructs has demonstrated that synthesis of biologically active TGF-jSl can proceed only 
25 from the first ATG codon, implicating LAP in the proper assembly of the small latent 
complex in these cells. Taken together, these studies indicate that LAP is sufficient to 
achieve the latent state. More recent studies have shown that carbohydrate structures within 
LAP make an important contribution to the latent state. For example, treatment of the latent 
form of TGF-jSl with endoglycosidase F led to activation of TGF-jS (Miyazono and Heldin, 
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1989). (The structure of the mature TGF-jS dimer was not affected by enzyme treatment). 
In particular, sialic acid residues seemed to be important, as treatment of the purified latent 
complex with sialidase was also able to activate TGF-/3 from the latent state. 



5 4. Modulation of Latency 

Latent complexes must be dissociated to activate mature TGF-|8, ari dissociation is 
considered to be a critical step in governing TGF-/S effects (Twardzik et al. , 1990; Sato et 
al, 1993). Dissociation by chemical treatment of the latent complex purified from platelets 
10 has been investigated (Miyazono et al. , 1990). Incubation of the purified complex under 

conditions of varying pH revealed that TGF-/3 activity was unmasked at values below pH 3.5 
and above pH 12.5. Incubation of latent TGF-jS in 0.02% SDS or 8 M urea also effectively 
Cj unmasked TGF-i3 activity, but incubation in 5 M NaCl did not. Wakefield et al. (Wakefield 
m et al. , 1989) have reported that, after activation, TGF-j31 and LAP reassociate in a time- and 
'^15 concentration-dependent manner under neutral, nondenaturing conditions. These results are 
^ consistent with the idea that the mature TGF-/3 dimer is non-covalently associated with LAP. 

m _ _ _ 

L Latent TGF-/3 complexes are also dissociated by the action of certain enzymes. For 

example, latent TGF-jS is activated by plasmin, which disrupts the structure of the large 
r^O latent complex (Lyons et al. , 1988; Taipale et al. , 1995). Similar data exist for other 
S enzymes, e.g., cathepsin D, mast cell chymase, leukocyte elastase, and the glycosidases. 
Recently, osteoclast-derived cells were shown to be capable of activating latent TGF-^ in 
vitro (Oreffo et al., 1989). Osteoclast activation is of particular interest because of the 
hypothesis that TGF-|3 serves as a link between bone turnover and formation during bone 
25 remodeling (Centrella et al. , 1991). The mechanism of TGF-/3 activation by osteoclasts is 
not known at present, but it is reasonable to think diat local alteration of pH due to action of 
proton pumps in the osteoclast plasma membrane or the release of osteoclast-derived 
proteases may be involved in the activation process. Related to these observations, activated 
macrophages (as might be found at a wound site or during tissue morphogenesis) secrete 
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sialidase and other proteases (Pilatte et al, 1987), and they can lower the local pH to 4.0 
(Silver et al. , 1988), both of which could contribute to TGF-jS activation in vivo. As 
mentioned above, acidification weakens the non-covalent interaction between LAP and the 
mature TGF-|S dimer (Wakefield et al., 1989). 

5 

STTMMARY OF THE INVENTION 

The present invention concerns in an overall and general sense novel DNA segments 
and recombinant vectors encoding LTBP-2 or LTBP-3, and the creation and use of 
10 recombinant host cells through the application of DNA technology, that express LTBP-2 or 
LTBP-3 gene products. As such, the invention concerns DNA segment comprising an 
isolated gene that encodes a protein or peptide that includes an amino acid sequence 
y essentially as set forth by a contiguous sequence from SEQ ID N0:2 or SEQ ID N0:4. 
f 1 These DNA segments are represented by those that include a nucleic acid sequence 
%5 essentially as set forth by a contiguous sequence from SEQ ID N0:1 or SEQ ID NO: 3. 

■ffi 

\jl Compositions that include a purified protein that has an amino acid sequence 

U essentially as set forth by the amino acid sequence of SEQ ID N0:2 or SEQ ID N0:4 are 
also encompassed by the invention. 

2 The TGF-|8s represent a family of strucmrally related molecules with diverse effects 

on mammalian cell shape, growth, and differentiation (Roberts and Spom, 1990). Initially 
synthesized as a precursor consisting of an amino-terminal propeptide followed by mature 
TGF-/3, two chains of nascent pro-TGF-|8 associate in most tissues to form a Mr ~ 106,000 
25 inactive disulfide-bonded dimer. Homodimers are most conunon, but heterodimers have also 
been described (Cheifetz et al., 1987; Ogawa et al., 1992). During biosynthesis the mature 
TGF-|8 dimer is cleaved from the propeptide dimer. TGF-/3 latency results in part from the 
non-covalent association of propeptide and mature TGF-i3 dimers (Pircher et al., 1984, 1986; 
Wakefield et al., 1987; Millan et al., 1992; Miyazono and Heldin, 1989). Consequently, the 
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propeptide dimer is often referred to as the latency associated protein (LAP), and LAP plus 
the disulfide-bonded TGF-|3 dimer are also known as the small latent complex. In the 
extracellular space small latent complexes must be dissociated to activate mamre TGF-/3. 
The mechanism of activation of die latent complex is thought to be one of the most important 
5 steps governing TGF-)3 effects (Lyons et al., 1988; Antonelli-Orlidge et ai, 1989; Twardzik 
etal, 1990; Szio et al, 1993). 

In certain Imes of culhired cells small latent growth factor complexes may contain 
additional high molecular weight proteins. The best characterized of these high molecular 
10 weight proteins is the latent TGF-|3 binding protein, or LTBP (Miyazono et al., 1988; 
Kanzaki et ai, 1990; Tsuji et ai, 1990; Olofsson et ai, 1992; Taketazu et ai. 1994). 
LTBP produced by different cell types is heterogeneous in size, perhaps because of 
i alternative splicing or because of tissue-specific proteolytic processing (Miyazono et al., 
S 1988- Wakefield et al.. 1988; Kanzaki et ai, 1990; Tsuji et al.. 1990). Latent TGF-/3 
B complexes that contain LTBP are known as large latent complexes. LTBP has no known 
covalent linkage to matore TGF-j3, but rather it is linked by a disulfide bond to LAP. 
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□ Regarding the novel protein LTBP-2 or LTBP-3, die present invention concerns DNA 

f! segments, that can be isolated from virtually any mammalian source, that are free from total 
m genomic DNA and that encode proteins having LrBP-2-like or LTBP-3-like activity. DNA 
O segments encoding LTBP-2-like or LTBP-3-like species may prove to encode proteins, 
polypeptides, subunits, functional domains, and the like. 

As used herein, the term "DNA segment" refers to a DNA molecule that has been 
25 isolated free of total genomic DNA of a particular species. Therefore, a DNA segment 
encoaing LTBP-2 or LTBP-3 refers to a DNA segment that contains LTBP-2 or LTBP-3 
coding sequences yet is isolated away from, or purified free from, total genomic DNA of the 
species from which the DNA segment is obtained. Included within the term "DNA 
segment", are DNA segments and smaller fragments of such segments, and also recombinant 
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vectors, including, for example, plasmids, cosmids, phagemids, phage, viruses, and the like. 

Similarly, a DNA segment comprising an isolated or purified LTBP-2 or LTBP-3 
gene refers to a DNA segment including LTBP-2 or LTBP-3 coding sequences and, in 
certain aspects, regulatory sequences, isolated substantially away from other naturally 
occurring genes or protein encoding sequences. In this respect, the term "gene" is used for 
simplicity to refer to a functional protein, polypeptide or peptide encoding unit. As will be 
understood by those in the art, this functional term includes both genomic sequences, cDNA 
sequences and smaller engineered gene segments that express, or may be adapted to express, 
proteins, polypeptides or peptides. 

"Isolated substantially away from other coding sequences" means that the gene of 
interest, in this case, a gene encoding LTBP-2 or LTBP-3, forms the significant part of the 
coding region of the DNA segment, and that the DNA segment does not contain large 
portions of naturally-occurring coding DNA, such as large chromosomal fragments or other 
functional genes or cDNA coding regions. Of course, this refers to the DNA segment as 
originally isolated, and does not exclude genes or coding regions later added to the segment 
by the hand of man. 

In particular embodiments, the invention concerns isolated DNA segments and 
recombinant vectors incorporating DNA sequences that encode an LTBP-2 or LTBP-3 
species that includes within its amino acid sequence an amino acid sequence essentially as set 
forth in SEQ ID N0:2 or SEQ ID N0:4. In other particular embodiments, the invention 
concerns isolated DNA segments and recombinant vectors incorporating DNA sequences that 
include within their sequence a nucleotide sequence essentially as set forth in SEQ ID N0:1 
or SEQ ID N0:3. 

The term "a sequence essentially as set forth in SEQ ID NO:2 or SEQ ID N0:4" 
means that the sequence substantially corresponds to a portion of SEQ ID N0:2 or SEQ ID 
N0:4 and has relatively few amino acids that are not identical to, or a biologically functional 



equivalent of, the amino acids of SEQ ID N0:2 or SEQ ID N0:4. The term "biologically 
functional equivalent" is well understood in the art and is further defined in detail herein (for 
example, see section 7, preferred embodiments). Accordingly, sequences that have between 
about 70% and about 80%; or more preferably, between about 81% and about 90%; or even 
5 more preferably, between about 91% and about 99%; of amino acids tfiat are identical or 
functionally equivalent to the amino acids of SEQ ID N0:2 or SEQ ID N0:4 will be 
sequences that are "essentially as set forth in SEQ ID N0:2 or SEQ ID N0:4". 

In certain other embodiments, the invention concerns isolated DNA segments and 
10 recombinant vectors that include within their sequence a nucleic acid sequence essentially as 
set forth in SEQ ID N0:1 or SEQ ID NO: 3. The term "essentially as set forth in SEQ ID 
N0:1 or SEQ ID NO: 3" is used in the same sense as described above and means that the 
nucleic acid sequence substantially corresponds to a portion of SEQ ID N0:1 or SEQ ID 
Ji N0:3 and has relatively few codons that are not identical, or functionally equivalent, to the 
|b codons of SEQ ID N0:1 or SEQ ID N0:3. Again, DNA segments that encode protems 
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exhibiting LTBP-2-like or LTBP-3-like activity will be most preferred. 



%. It will also be understood that amino acid and nucleic acid sequences may include 

6i additional residues, such as additional N- or C-terminal amino acids or 5' or 3' sequences, 
fl20 and yet still be essentially as set forth in one of the sequences disclosed herein, so long as 
y the sequence meets the criteria set forth above, mchiding the maintenance of biological 

protein activity where protein expression is concemed. The addition of terminal sequences 
particularly applies to nucleic acid sequences that may, for example, include various non- 
coding sequences flanking either of the 5' or 3' portions of the coding region or may include 
25 various internal sequences, Le. , introns, which are known to occur within genes. 

Naturally, the present invention also encompasses DNA segments that are 
complementary, or essentially complementary, to the sequence set forth in SEQ ID NO:l or 
SEQ ID N0:3. Nucleic acid sequences that are "complementary" are those that are capable 
30 of base-pairing according to the standard Watson-Crick complementarity rules. As used 
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herein, the term "complementary sequences" means nucleic acid sequences that are 
substantially complementary, as may be assessed by the same nucleotide comparison set forth 
above, or as defined as being capable of hybridizing to the nucleic acid segment of SEQ ID 
N0:1 or SEQ ID N0:3, under relatively stringent conditions such as those described herein. 

The nucleic acid segments of the present invention, regardless of the length of the 
coding sequence itself, may be combined with other DNA sequences, such as promoters, 
polyadenylation signals, additional restriction enzyme sites, multiple cloning sites, other 
coding segments, and the like, such that their overall length may vary considerably. It is 
therefore contemplated that a nucleic acid fragment of aknost any length may be employed, 
with the total length preferably being limited by the ease of preparation and use in the 
intended recombinant DNA protocol. For example, nucleic acid fragments may be prepared 
that include a short contiguous stretch identical to or complementary to SEQ ID N0:1 or 
SEQ ID NO: 3, such as about 14 nucleotides, and that are up to about 10,000 or about 5,000 
base pairs in length, with segments of about 3,000 being preferred in certain cases. DNA 
segments with total lengths of about 1,000, about 500, about 200, about 100 and about 50 
base pairs in length (including all intermediate lengths) are also contemplated to be useful. 

It will be readily understood that "mtermediate lengths", in these contexts, means any 
length between the quoted ranges, such as 14, 15, 16, 17, 18, 19, 20, etc.; 21, 22, 23, etc; 
30, 31, 32, etc.; 50, 51, 52, 53, etc; 100, 101, 102, 103, etc; 150, 151, 152, 153, etc; 
including all integers through the 200-500; 500-1,000; 1,000-2,000; 2,000-3,000; 3,000- 
5,000; 5,000-10,000 ranges, up to and including sequences of about 12,001, 12,002, 13,001, 
13,002 and the like. 

It will also be understood that this invention is not limited to the particular nucleic 
acid and amino acid sequences of SEQ ID N0:1 or SEQ ID N0:3; and SEQ ID N0:2 or 
SEQ ID NO:4, respectfully. Recombinant vectors and isolated DNA segments may therefore 
variously include the LTBP-2 or LTBP-3 coding regions themselves, coding regions bearing 
selected alterations or modifications in the basic coding region, or they may encode larger 
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polypeptides that nevertheless include LTBP-2 or LTBP-3-coding regions or may encode 
biologically functional equivalent proteins or peptides that have variant amino acids 
sequences. 

5 The DNA segments of the present invention encompass biologically functional 

equivalent LTBP-2 or LTBP-3 proteins and peptides. Such sequences may arise as a 
consequence of codon redundancy and functional equivalency that are known to occur 
naturally within nucleic acid sequences and the proteins thus encoded. Alternatively, 
functionally equivalent proteins or peptides may be created via the application of recombinant 

10 DNA technology, in which changes in the protein strucmre may be engineered, based on 
considerations of the properties of the amino acids being exchanged. Changes designed by 
man may be introduced through the application of site-directed mutagenesis techniques, e.g. , 

O to introduce unprovements to the antigenicity of the protein or to test mutants in order to 

^ examine activity at the molecular level. 

35 

W If desired, one may also prepare fusion proteins and peptides, e.g. , where the LTBP-2 

J or LTBP-3 coding regions are aligned within the same expression unit with other proteins or 
L peptides having desired functions, such as for purification or immunodetection purposes 
(e.g. , proteins that may be purified by affinity chromatography and enzyme label coding 

rgO regions, respectively). 

O 

^ Recombinant vectors form further aspects of the present invention. Particularly useful 

vectors are contemplated to be those vectors in which the coding portion of the DNA 
segment, whether encoding a full length protein or smaller peptide, is positioned under the 

25 control of a promoter. The promoter may be in the form of the promoter that is naturally 
associated with a LTBP-2 or LTBP-3 gene, as may be obtained by isolating the 5' non- 
coding sequences located upstream of the coding segment or exon, for example, using 
recombinant cloning and/or PGR™ technology, in connection with the compositions disclosed 
herein. 
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In other embodiments, it is contemplated that certain advantages will be gained by 
positioning the coding DNA segment under the control of a recombinant, or heterologous, 
promoter. As used herein, a recombinant or heterologous promoter is intended to refer to a 
promoter that is not normally associated with an LTBP-2 or LTBP-3 gene in its natural 
5 environment. Such promoters may include LTBP-2 or LTBP-3 promoters normally 

associated with other genes, and/or promoters isolated from any bacterial, viral, eukaryotic, 
or mammaliar cell. Naturally, it will be important to employ a promoter that effectively 
directs the expression of the DNA segment in the cell type, organism, or even animal, 
chosen for expression. The use of promoter and cell type combinations for protein 
10 expression is generally known to those of skill in the art of molecular biology, for example, 
see Sambrook et al, 1989. The promoters employed may be constitutive, or inducible, and 
can be used under the appropriate conditions to direct high level expression of the introduced 
O DNA segment, such as is advantageous in the large-scale production of recombinant proteins 
in or peptides. Appropriate promoter systems contemplated for use in high-level expression 
%5 include, but are not limited to, the Pichia expression vector system (Pharmacia LKB 
J Biotechnology), 
ffi _ , _ 

%. In connection with expression embodiments to prepare recombinant LTBP-2 or LTBP- 

^' 3 proteins and peptides, it is contemplated that longer DNA segments will most often be 
rgO used, with DNA segments encoding the entire LTBP-2 or LTBP-3 protein or functional 
y domains, subunits, etc. being most preferred. However, it will be appreciated that the use of 
shorter DNA segments to direct the expression of LTBP-2 or LTBP-3 peptides or epitopic 
core regions, such as may be used to generate anti- LTBP-2 or LTBP-3 antibodies, also falls 
within the scope of the invention. DNA segments that encode peptide antigens from about 
25 15 to about 50 amino acids in length, or more preferably, from about 15 to about 30 amino 
acids in length are contemplated to be particularly useful. 

The LTBP-2 or LTBP-3 gene and DNA segments may also be used in connection 
with somatic expression in an animal or in the creation of a transgenic animal. Again, in 
30 such embodiments, the use of a recombinant vector that directs the expression of the full 
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length or active LTBP-2 or LTBP-3 protein is particularly contemplated. 

In addition to their use in directing the expression of the LTBP-2 or LTBP-3 protein, 
the nucleic acid sequences disclosed herein also have a variety of other uses. For example, 
they also have utility as probes or primers in nucleic acid hybridization embodiments. As 
such, it is contemplated that nucleic acid segments that comprise a sequence region that 
consists of at least a 14 nucleotide long contiguous sequence that has the s^me sequence as, 
or is complementary to, a 14 nucleotide long contiguous sequence of SEQ ID N0:1 or SEQ 
ID N0:3 will find particular utility. Longer contiguous identical or complementary 
sequences, e.g., those of about 20, 30, 40, 50, 100, 200, 500, 1000 (including all 
intermediate lengths) and even up to full length sequences will also be of use in certain 
embodiments. 

The ability of such nucleic acid probes to specifically hybridize to LTBP-2 or LTBP- 
3-encoding sequences will enable them to be of use in detecting the presence of 
complementary sequences in a given sample. However, other uses are envisioned, including 
the use of the sequence information for the preparation of mutant species primers, or primers 
for use in preparing other genetic constructions. 

Nucleic acid molecules having sequence regions consisting of contiguous nucleotide 
stretches of 10-14, 15-20, 30, 50, or even of 100-200 nucleotides or so, identical or 
complementary to SEQ ID N0:1 or SEQ ID N0:3, are particularly contemplated as 
hybridization probes for use in, e,g. , Southern and Northern blotting. This would allow 
LTBP-2 or LTBP-3 structural or regulatory genes to be analyzed, both in diverse cell types 
and also in various manmialian cells. The total size of fragment, as well as the size of the 
complementary stretch(es), will ultimately depend on the intended use or application of the 
particular nucleic acid segment. Smaller fragments will generally find use in hybridization 
embodiments, wherein the length of die contiguous complementary region may be varied, 
such as between about 10-14 and about 100 nucleotides, but larger contiguous 
complementarity stretches may be used, according to the length complementary sequences 
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one wishes to detect. 



The use of a hybridization probe of about 10-14 nucleotides in length allows the 
formation of a duplex molecule that is both stable and selective. Molecules having 
contiguous complementary sequences over stretches greater than 10 bases in length are 
generally preferred, though, in order to increase stability and selectivity of the hybrid, and 
therebv improve the quality and degree of specific hybrid molecules obtained. One will 
generally prefer to design nucleic acid molecules having gene-complementary stretches of 15 
to 20 contiguous nucleotides, or even longer where desired. 

Hybridization probes may be selected from any portion of any of the sequences 
disclosed herein. All that is required is to review the sequence set forth in SEQ ID N0:1 or 
SEQ ID N0:3 and to select any continuous portion of the sequence, from about 10-14 
nucleotides in length up to and including the tun length sequence, that one wishes to utilize 
as a probe or primer. The choice of probe and primer sequences may be governed by 
various factors, such as, by way of example only, one may wish to employ primers from 
towards the termini of the total sequence. 

The process of selecting and preparing a nucleic acid segment that includes a 
contiguous sequence from within SEQ ID N0:1 or SEQ ID NO: 3 may alternatively be 
described as preparing a nucleic acid fragment. Of course, fragments may also be obtained 
by other techniques such as, e.g., by mechanical shearing or by restriction enzyme digestion. 
Small nucleic acid segments or fragments may be readily prepared by, for example, directly 
synthesizing the fragment by chemical means, as is commonly practiced using an automated 
oligonucleotide synthesizer. Also, fragments may be obtained by application of nucleic acid 
reproduction technology, such as the PGR™ technology of U.S. Patent 4,603,102 
(incorporated herein by reference), by introducing selected sequences into recombinant 
vectors for recombinant production, and by other recombinant DNA techniques generally 
known to those of skill in the art of molecular biology. 
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Accordingly, the nucleotide sequences of the invention may be used for their ability to 
selectively form duplex molecules witii complementary stiretches of LTBP-2 or LTBP-3 gene 
or cDNA fragments. Depending on the application envisioned, one will desire to employ 
varying conditions of hybridization to achieve varying degrees of selectivity of probe towards 
target sequence. For applications requiring high selectivity, one will typically desire to 
employ relatively stringent conditions to form' the hybrids, e.g., one will select relatively low 
salt and/or high ^emperatiire conditions, such as provided by about 0.02 M to about 0.15 M 
NaCl at temperatures of 50°C to 70°C. Such selective conditions tolerate little, if any, 
mismatch between the probe and the template or target sti-and, and would be particularly 
suitable for isolating LTBP-2 or LTBP-3 genes. 

Of course, for some applications, for example, where one desires to prepare mutants 
employing a mutant primer strand hybridized to an underlying template or where one seeks 
to isolate LTBP-2 or LTBP-2 or LTBP-3-encodiQg sequences from related species, functional 
equivalents, or tiie like, less stringent hybridization conditions will typically be needed in 
order to allow formation of the heteroduplex. In these circumstances, one may desire to 
employ conditions such as about 0.15 M to about 0.9 M salt, at temperamres ranging from 
20°C to 55 °C. Cross-hybridizing species can thereby be readily identified as positively 
hybridizing signals witii respect to control hybridizations. In any case, it is generally 
appreciated tiiat conditions can be rendered more stringent by the addition of increasing 
amounts of formamide, which serves to destabilize the hybrid duplex in the same manner as 
increased temperature. Thus, hybridization conditions can be readily manipulated, and thus 
will generally be a method of choice depending on tiie desired results. 

In certain embodiments, it will be advantageous to employ nucleic acid sequences of 
the present invention in combination with an appropriate means, such as a label, for 
determining hybridization. A wide variety of appropriate indicator means are known in tiie 
art, including fluorescent, radioactive, enzymatic or other ligands, such as avidin/biotin, 
which are capable of giving a detectable signal. In preferred embodiments, one will likely 
desire to employ a fluorescent label or an enzyme tag, such as urease, alkaline phosphatase 
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or peroxidase, instead of radioactive or other environmental undesirable reagents. In the 
case of enzyme tags, colorimetric indicator substrates are known that can be employed to 
provide a means visible to the human eye or spectrophotometrically, to identify specific 
hybridization v^ith complementary nucleic acid-containing samples. 

In general, it is envisioned that the hybridization probes described herein will be 
useful both as reagents in solution hybridization as well as in embodiments employing a solid 
phase. In embodiments involving a solid phase, the test DNA (or RNA) is adsorbed or 
otherwise affixed to a selected matrix or surface. This fixed, single-stranded nucleic acid is 
then subjected to specific hybridization with selected probes under desired conditions. The 
selected conditions will depend on the particular circumstances based on the particular 
criteria required (depending, for example, on the G+C content, type of target nucleic acid, 
source of nucleic acid, size of hybridization probe, etc). Following washing of the 
hybridized surface so as to remove nonspecifically bound probe molecules, specific 
hybridization is detected, or even quantitated, by means of the label. 

BRIEF DESCRIPTION OF THE DRAWINGS 

The drawings form part of the present specification and are included to further 
demonstrate certain aspects of the present invention. The mvention may be better understood 
by reference to one or more of these drawings in combination with the detailed description of 
specific embodunents presented herein. 

FIG. 1. Overlapping murine cDNA clones representing the LTBP-like (LTBP-2 or 
LTBP-3) sequence. A partial representation of restriction sites is shown. N, Ncol\ P, PvwII; 
R, Rsall\ B, BamHI; H, /fmdIII. The numbering system at the bottom assumes that the "A" 
of the initiator Met codon is nt #1 . 

FIG. 2A. A schematic showing the structure of the murine fibrillin-1 gene product. 
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Structural domains are shown below the diagram. Symbols designating various structural 
elements are defined in the legend to FIG. 2B. 

FIG. 2B. A schematic showing the structure of the murine LTBP-like (LTBP-2 or 
LTBP-3) molecule. Domains #1-5 are denoted below the diagram. Symbols designate the 
following structural elements: EGF-CB repeats: open rectangles; TGF-bp repeats: open 
ovals; Fib motif: open circle; TGF-bp-like repeat: patterned oval; cysteine-rich sequences: 
patterned rectangles; proline/glycine-rich region: thick curved Ime, domain #2; proline-rich 
region, thick curved line, domain #3. Note that symbols designating the signal peptide have 
been deleted for simplicity. AdditionaUy, the schematic assumes that EGF-like and EGF-CB 
repeats may extend for several amino acids beyond the Cg position. 

FIG. 2C. A schematic showing the structure of human LTBP-1. Domains #1-5 are 
denoted below the diagram. The symbols designating the structural elements are defined in 
the legend to FIG. 2B. 

FIG. 3A, FIG. 3B, FIG. 3C, FIG. 3D, FIG. 3E and FIG. 3F. Overview of 
expression of the new LTBP-like (LTBP-2 or LTBP-3) gene during murine development as 
determined by tissue in situ hybridization. FIG. 3A. FIG. 3B, FIG. 3C, FIG. 3D, FIG. 3E 
and FIG. 3F are autoradiograms made by direct exposure of tissue sections to fihn after 
hybridization with radiolabeled probes. Identical conditions were maintained throughout 
autoradiography and photography, thereby allowing a comparison of the overall strength of 
hybridization in all tissue sections. The transcript is expressed in connective tissue, 
mesenchyme, liver, heart and CNS. 

FIG. 3A. Day 8.5-9.0; sections contain embryos surrounded by intact membranes, 
uterine tissues, and the placental disk, cut in random planes; anti-sense probe. 

FIG. 3B. Day 8.5-9.0; sections contain embryos surrounded by intact membranes, 
uterine tissues, and the placental disk, cut in random planes; sense probe. 
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FIG. 3C, Day 13.5; sections contain isolated whole embryos sectioned in the sagittal 
plane near or about the mid-line; anti-sense probe. 

FIG. 3D. Day 13.5; sections contain isolated whole embryos sectioned m the sagittal 
5 plane near or about the mid-line; sense probe. 

FIG. 3E. Day 16.5; sections contain isolated whole embryos sectioned in the sagittal 
plane near or about the mid-line, anti-sense probe. 

10 FIG. 3F. Day 16.5; sections contain isolated whole embryos sectioned in the sagittal 

plane near or about the mid-line, sense probe. 

0 FIG. 4. Time-dependent expression of the LTBP-3 gene by MC3T3-E1 cells. 

|fl mRNA preparation and Northern blotting were preformed as described in Example XIV. 
SI5 Equal aliquots of total RNA as determined by UV spectroscopy were loaded in each lane of 

is Is? 

the Northern gel. As demonstrated by UV spectroscopy were loaded in each lane of the 
[ft Northem gel. As demonstrated by methylene blue staining (Sambrook et al , 1989), equal 
% amounts of RNA were transferred to the nylon membrane. The results demonstrate a clear, 

01 strong peak in LTBP-3 gene expression by 14 days in culture. Weaker signals denoting 
|§0 LTBP-3 gene expression also can be observed after 5 days and 28 days in culture. 

FIG. 5. Antisera #274 specifically binds LTBP-3 epitopes. Transfection of 293T 
cells with a fiill length mouse LTBP-3 expression plasmid followed by radiolabeling, 
preparation of medium sample, immunoprecipitation, and 4-18% gradient SDS-PAGE were 
25 performed as described in Example XIV. The figure presents a SDS-PAGE autoradiogram 
of medium samples following a 2 day exposure to fihn. Lane assignments are as follows: 
Lane 1, radiolabeled 293T medium (prior to transfection) immunoprecipitated with 
preimmune serum; Land 2, radiolabeled 293T medium (prior to transfection) 
immunoprecipitated with antibody #274; Lane 3, radiolabeled 393T medium (following 
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transfection and preincubation with 10 /xg of LTBP-3 synthetic peptide cocktail) 
immunoprecipitated with antibody #274; and Lane 4, radiolabeled 293T medium (following 
transfection) inununoprecipitated with antibody #274. As indicated by the bar, the fall length 
LTBP-3 molecule migrated at 180-190 kDa. 

FIG. 6. Co-immunoprecipitation of LTBP-3 and TGR-jSl produced by MC3T3-E1 
cells. Aliquots ( — 10^ incorporated CPM) of radiolabeled media produced by MC3T3-E1 
cells after 7 days in culture were inmiunoprecipitated as described in Example XIV. Bars 
indicate the position of cold molecular weight standards used to estimate molecular weight 
(Rainbow mix, Amersham). Immunoprecipitates were separated using 4%-18% gradient 
SDS-PAGE and reducing conditions. The figure shows a negative control lane 1 consisting 
of MC3T3-E1 medium immunoprecipitated with anti-LTBP-3 antibody #274. Western 
blotting was performed using the lower portion of the gradient gel and a commercially 
available antibody to TGF-jSl (Santa Cruz Biotechnology, Inc.). Antibody staining was 
detected using commercially available reagents and protocols (ECL Western Blotting 
Reagent, Amersham). MC3T3-E1 medium was immunoprecipitated with anti-LTBP-2 
antibody #274. 

FIG. 7. DNA sequenci of the murine LTBP-2 gene (SEQ ID N0:1). 

FIG. 8. Amino acid s^r^uSSEe^the murine LTBP-2 gene product (SEQ ID NO:2). 

FIG. 9. DNA sequence oMh^^ LTBP-3 gene (SEQ ID NO: 3). 

FIG. 10. Amino acid sequence oVthe murine LTBP-3 gene product (SEQ ID N0:4). 

X^^^^Z FIG. II. Mouse ltbp-\cDNA Clones. The schematic figure presents overlapping 
^ouse cDNA clones representing^ the mouse ltbp-2 coding sequence. A partial representation 
of the restriction sites is shown. \, Avrll] N, Nael\ Sa, Scall; X, Xhol; B, BamHI; C, C/al; 
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and E,^coKl. 

FIG. 12A. Itb^^ Gene Expression in the Developing Mouse Skeleton. Shown is an 
overview of the of the gene during mouse development, as determine by tissue in situ 
hybridization. The figure ptesents an autoradiogram made by direct exposure of tissue 
sections to fihn after hybridization with radiolabeled probes, but before dipping slides in 
radiographic ^-mulsion. Day 16\ p.c. sections contain whole embryos sectioned in the mid- 
sagittal plane. Identical conditionsVere maintained throughout autoradiography and 
photography, making it possible to c^pare the overall strength of hybridization with 
antisense (top) and sense (bottom) prob^ 1 cm = 20 ^m. 



FIG. 12B. ltbp-2 Gene Expression m the Developing Mouse Skeleton. Shown is a 
selected brightfield microscopic view of mouses/fZ?p-2 gene expression in cartilage of day 
16.5 p.c. mouse embryos. Photograph was takeXfrom tissue sections following a two week 
exposure to photographic emulsion. 1 cm = 20 /nr 

FIG. lie. ltbp-2 Gene Expression in the Devellsroing Mouse Skeleton. Shown is a 
selected darkfield microscopic view of mouse Uhp-2 gene expression in cartilage of day 16.5 
p.c. mouse embryos. Photograph was taken from tissue sectfsms following a two week 
exposure to photographic emulsion. In all darkfield photograph^ed blood cell and other 
plasma membranes give a faint white signal that contributes to the\ackground of the smdy. 
1 cm = 20 /im. 



FIG. X^. Co-dlansfection of 293T Cells With pLTBP-3fl and pTGF-/31. 
Immunoprecipitation of lSTBP-3 and TGF-/31 produced by 293T cells following transient 



transfection and radiolabeli 



Aliquots (-10* incorporated CPM) of radiolabeled media 
were immunoprecipitated and'^^raf^ed using 4%-18% gradient SDS-PAGE and either 



reducing or nonreducing conditi 
the position of cold standarc 



described (Yin et al., 1995). Bars on the left indicate 
't\estimate molecular weight: 200, 97.4, 69, 46, 30, 
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21.5 and 14.3 kDa (RaiAow mix, Amersham). Lane assignments are as follows: Lane 1, 
293T cell medium proteinA immunoprecipitated with 274 antibody (separated under reducing 
conditions) after co-transfe\tion with pLTBP-3fl and pTGF-^1; Lane 2, 293T cell medium 
proteins immunoprecipitatedWith 274 antibody (separated under nonreducing conditions) 
after co-transfection with pLTIBP-3fl and pTGF-|81; Lane 3, untransfected 293T cell medium 
proteins immunoprecipitated w\th 274 antibody (separated under reducing conditions); Lane 
4, 293T c«ll medium protein? idununoprecipitated v ith 274 antibody (sep?-ated under 
reducing conditions) after transfeVtion with pLTBP-3fl; Lane 5, 293T ceU medium proteins 
immunoprecipitated with 4()0,?lr-afttibDd}s<5raarated under reducing conditions) after 
transfection with pTGF-/31; Lan^dL 293T ceh medium proteins immunoprecipitated with 
40091 antibody (separated under hobreducmg conditions) after transfection with pTGF-/31; 
and Lane 7, 293T cell medium pr^fai^mmiunoprecipitated with 40091 antibody (separated 
under reducing conditions) after co-trksfection with pLTBP-3fl and pTGF-^1. Note that the 
signal is weakest in lanes 5-7, in whictt proteins were unmunoprecipitated using the 40091 
antibody, reflecting the weaker affmity if the 40091 antiserum. 

DESCRTPTION OF THE PREFERRED EMBODIMENT 
1. Applications of Bone Repair Technology to Human Treatment 

The following is a brief discussion of four human conditions to exemplify the variety 
of diseases and disorders that would benefit from the development of new technology to 
improve bone repau- and healing processes. In addition to the following, several other 
conditions, such as,' for example, vitamin D deficiency; wound healing in general; soft 
skeletal tissue repair; and cartilage and tendon repair and regeneration, may also benefit from 
technology concerning the stimulation of bone progenitor cells. 

The furst example is the otherwise healthy individual who suffers a frachire. Often, 
clinical bone fracture is treated by casting to alleviate pain and allow natural repair 
mechanisms to repair the wound. While there has been progress in the treatment of fracmre 
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in recent times, even without considering the various complications that may arise in treatmg 
fractured bones, any new procedures to increase bone healing in normal circumstances would 
represent a great advance. 

A second example which may benefit from new treatment methods is osteogenesis 
imperfecta (01). 01 encompasses various inherited connective tissue diseases that involve 
bone ?nd soft connective tissue fragility in humans (Byers and Sterner, 1992; Prockop, 
1990). About one child per 5,000-20,000 bom is affected with 01 and the disease is 
associated with significant morbidity throughout life. A certain number of deaths also occur, 
resulting in part from the high propensity for bone fracture and the deformation of abnormal 
bone after fracture repair (01 types D-IV; Bonadio and Goldstein, 1993). The relevant issue 
here is quality of life; clearly, the lives of affected individuals would be improved by the 
development of new tiierapies designed to stimulate and strengthen the fracture repair 
process. 

01 type I is a mild disorder characterized by bone fracture wittiout deformity, blue 
sclerae, normal or near normal stature, and autosomal dominant inheritance (Bonadio and 
Goldstein, 1993). Osteopenia is associated with an increased rate of lone bone fracture upon 
ambulation (the fracture frequency decreases dramatically at puberty and during young adult 
life, but increases once again in late middle age). Hearing loss, which often begins in the 
second or third decade, is a feature of this disease in about half the families and can progress 
despite the general decline in fracture frequency. Dentinogenesis imperfecta is observed in a 
subset of individuals. 

In contrast, 01 types 11- VI represent a spectrum of more severe disorders associated 
with a shortened life-span. 01 type II, the perinatal lethal form, is characterized by short 
stature, a soft calvarium, blue sclerae, fragile skin, a small chest, floppy appearing lower 
extremities (due to external rotation and abduction of the femurs), fragile tendons and 
ligaments, bone fracture with severe deformity, and death in the perinatal period due to 
respiratory insufficiency. Radiographic signs of bone weakness include compression of the 
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femurs, bowing of the tibiae, broad and beaded ribs, and calvarial thinning. 

01 type in is characterized by short stature, a triangular facies, severe scoliosis, and 
bone fracture with moderate deformity. Scoliosis can lead to emphysema and a shortened 
5 life-span due to respiratory msufficiency. 01 type IV is characterized by normal sclerae, 
bone fracture with mild to moderate deformity, tooth defects, and a namral history that 
essentially is intermediate between 01 type 11 and 01 type I. 

More than 200 01 mutations have been characterized since 1989 (reviewed in Byers 
10 and Steiner, 1992; Prockop, 1990). The vast majority occur in the COLlAl and C0L1A2 
genes of type I collagen. Most cases of 01 type I appear to result from heterozygous 
mutations in the COLlAl gene that decrease collagen production but do not alter primary 
y structure, i.e., heterozygous null mutations affecting COLlAl expression. Most cases of 01 
Ul types II-IV result from heterozygous mutations in the COLlAl and C0L1A2 genes that alter 

fijlS the structure of collagen. 

ffi 

m 

iff A third important example is osteoporosis. The term osteoporosis refers to a 

Q heterogeneous group of disorders characterized by decreased bone mass and fractures. An 

f} estimated 20-25 million people are at increased risk for fracture because of site-specific bone 
ry20 loss. Risk factors for osteoporosis include increasing age, gender (more females), low bone 
□ mass, early menopause, race (Caucasians), low calcium intake, reduced physical activity, 

genetic factors, environmental factors (including cigarette smoking and abuse of alcohol or 
caffeine), and deficiencies in neuromuscular control that create a propensity to fall. 

25 More than a million fractures in the USA each year can be attributed to osteoporosis, 

and in 1986 alone the treatment of osteoporosis cost an estimated 7-10 billion health care 
dollars. Demographic trends (i.e., the gradually increasing age of the US population) 
suggest that these costs may increase 2-3 fold by the year 2020 if a safe and effective 
treatment is not found. Clearly, osteoporosis is a significant health care problem. 

30 
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Clinically, osteoporosis is segregated into type I and type 11 . Type I osteoporosis 
occurs predominantly in middle aged women and is associated with estrogen loss at the 
menopause, while osteoporosis type II is associated with advancing age. Much of the 
morbidity and mortality associated with osteoporosis results from immobilization of elderly 
patients following fracture. 

Current therapies for osteoporosis patients focus on fracture prevention, not fracture 
repair. This remains an important consideration because of the literature, which clearly 
states that significant morbidity and mortality are associated with prolonged bed rest in the 
elderly, particularly those who have suffered hip fractures. Complications of bed rest 
include blood clots and pneumonia. These complications are recognized and measures are 
usually taken to avoid them, but these measures hardly represent the best approach to 
therapy. Thus, the osteoporotic patient population would benefit from new therapies 
designed to strengthen bone and speed up the fracture repair process, thereby getting these 
people on their feet before the complications arise. 

A fourth example is related to bone reconstruction and, specifically, the ability to 
reconstruct defects in bone tissue that result from traumatic injury; cancer or cancer surgery; 
birth defect; a developmental error or heritable disorder; or aging. There is a significant 
orthopaedic need for more stable total joint hnplunts, and cranial and facial bone are 
particular targets for this type of reconstructive need. The availability of new implant 
materials, e.g, , titanium, has permitted the repair of relatively large defects. Titanium 
implants provide excellent temporary stability across bony defects. However, experience has 
shown that a lack of viable bone bridging the defect can result in exposure of the appliance, 
infection, structural instability and, ultimately, failure to repair the defect. 

Autologous bone grafts are another possible reconstructive modality, but they have 
several demonstrated disadvantages in that they must be harvested from a donor site such as 
iliac crest or rib, they usually provide insufficient bone to completely fill the defect, and the 
bone that does form is sometimes prone to infection and resorption. Partially purified 
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xenogeneic preparations are not practical for clinical use because microgram quantities are 
purified from kilograms of bovine bone, making large scale commercial production both 
costly and impractical. Allografts and demineralized bone preparations are therefore often 
employed. 

Microsurgical transfers of free bone grafts with attached soft tissue and blood vessels 
can close bony defects with an immediate source of blood supply to the graft. However, 
these techniques are time consuming, have been shown to produce a great deal of morbidity, 
and can only be used by specially trained individuals. Furthermore, the bone implant is 
often limited in quantity and is not readily contoured. In the mandible, for example, the 
majority of patients cannot wear dental appliances using presently accepted techniques (even 
after continuity is established), and thus gain little improvement in the ability to masticate. 
Toriumi et al. , have written that, "reconstructive surgeons should have at their disposal a 
bone substitute that would be reliable, biocompatible, easy to use, and long lasting and that 
would restore mandibular continuity with little associated morbidity." 

In connection with bone reconstruction, specific problem areas for improvement are 
those concerned with treating large defects, such as created by trauma, birth defects, or 
particularly, following tumor resection. The success of orthopaedic implants, interfaces and 
artificial joints could conceivably be improved if the surface of the implant, or a functional 
part of an implant, were to be coated with a bone stimulatory agent. The surface of implants 
could be coated with one or more appropriate materials in order to promote a more effective 
interaction with the biological site surrounding the implant and, ideally, to promote tissue 
repair. 

2. Bone Repair 

Bone tissue is known to have the capacity for repair and regeneration and there is a 
certain understanding of the cellular and molecular basis of these processes. The initiation of 
new bone formation involves the commitment, clonal expansion, and differentiation of 
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progenitor cells. Once initiated, bone formation is promoted by a variety of polypeptide 
growth factors, Newly formed bone is then maintained by a series of local and systemic 
growth and differentiation factors. 

The concept of specific bone growth-promoting agents is derived from the work of 
Huggins and Urist. Huggms et al. , 1936, demonstrated that autologous transplantation of 
canine incisor tooth to skeletal muscle resulted in local new bone formation (Huggins et al., 
1936). Urist and colleagues reported that demineralized lyophilized bone segments induced 
bone formation (Urist, 1965; Urist et al., 1983), a process that involved macrophage 
chemotaxis; the recruitment of progenitor cells; the formation of granulation tissue, cartilage, 
and bone; bone remodeling; and marrow differentiation. The initiation of cartilage and bone 
formation in an extraskeletal site, a process referred to as osteoinduction, has permitted the 
unequivocal identification of initiators of bone morphogenesis (Urist, 1965; Urist et al. , 
1983; Sampath et al., 1984; Wang et al., 1990; Cunningham et al., 1992). 

Significant progress has now been made in characterizing the biological agents 
elaborated by active bone tissue during growth and natural bone healing. Demineralized 
bone matrix is highly insoluble; Sampath and Reddi (1981) showed that only 3% of the 
proteins can be extracted using strong combinations of denaturants and detergents. They also 
showed that the unfractionated demineralized bone extract will mitiate bone morphogenesis, a 
critical observation that led to the purification of "osteoinductive" molecules. Families of 
proteinaceous osteoinductive factors have now been purified and characterized. They have 
been variously referred to in the literamre as bone morphogenetic or morphogenic proteins 
(BMPs), osteogenic bone inductive protems or osteogenic proteins (OPs). 

3. Bone Repair and Growth Factors and Cytokines 

Following their initial purification, several bone morphogenetic protein genes have 
now been cloned using molecular techniques (Wozney et al. , 1988; Rosen et al. , 1989; 
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summarized in Alper, 1994). This work has established BMPs as members of the 
transforming growth factor-/? (TGF-|8) superfamily based on DNA sequence homologies. 
Other TGF molecules have also been shown to participate m new bone formation, and TGF- 
j8 is regarded as a complex multifunctional regulator of osteoblast function (Centrella et al. , 
1988; Carrington et al., 1988; Seitz et al., 1992). Indeed, the family of transforming growth 
factors (TGF-iSl, TGF-i82, and TGF-|83) has been proposed as potentially useful in the 
treatment of bone disease (U.S. Patent 5,125,978, incorporated herein by reference). 

Transforming growth factors (TGFs) have a central role in regulating tissue healing 
by affecting cell proliferation, gene expression, and matrix protein synthesis (Roberts and 
Spom, 1989). While not necessarily a direct effect, evidence has been provided that TGF-^1 
and TGF-i82 can initiate both chondrogenesis and osteogenesis (Joyce et al. , 1990; Izumi et 
al., 1992; Jingushi et al., 1992). In these studies new cartilage and bone formation appeared 
to be dose dependent (i.e. , dependent on the local growth factor concentration). The data 
also suggested that TGF-/31 and TGF-/32 stunulated cell differentiation by a similar 
mechanism, even though they differed in terms of the ultimate amount of new cartilage and 
bone that was formed. 

Other growth factors/hormones besides TGF and BMP may influence new bone 
formation following fracture. Bolander and colleagues injected recombinant acidic fibroblast 
growth factor into a rat fracture site (Jingushi et al. , 1990). The major effect of multiple 
high doses (1.0 mg/50 ml) was a significant increase in cartilage tissue m the fracture gap, 
while lower doses had no effect. These investigators also used the reverse transcriptase- 
polymerase chain reaction (PGR™) technique to demonstrate expression of estrogen receptor 
transcripts in callus tissue (Boden et al. , 1989). These results suggested a role for estrogen 
in normal fracture repair. 

Horowitz and colleagues have shown that activated osteoblasts will synthesize the 
cytokine, macrophage colony stimulating factor (Horowitz et al., 1989). The osteotropic 
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agents used in this study included lipopolysaccharide, PTHl-84, PTHl-34, vitamin D and 
all-trans retinoic acid. This observation has led to the suggestion that osteoblast activation 
following fracture may lead to the production of cytokines that regulate both hematopoiesis 
and new bone formation. Various other protems and polypeptides that have been found to be 
5 expressed at high levels in osteogenic cells, such as, e.g. , the polypeptide designated Vgr-1 
(Lyons et al , 1989), also have potential for use in connection with the present invention. 

4. Protein Administration and Bone Repair 

10 Several studies have been conducted in which preparations of protein growth factors, 

including BMPs, have been administered to animals in an effort to stimulate bone growth. 
The results of four such exemplary studies are described blow. 

Toriumi et al , studied the effect of recombinant BMP-2 on the repair of surgically 

m 

m created defects in the mandible of adult dogs (Toriumi et al , 1991). Twenty-six adult 

% hounds were segregated into three groups following the creation of a 3 cm full thickness 

\^ " 

W mandibular defect: 12 animals received test implants composed of inactive dog bone matrix 

III _ 

a carrier and human BMP-2, 10 animals received control implants composed of carrier without 

% BMP-2, and BMP-4 animals received no implant. The dogs were euthanized at 2.5-6 

"h20 months, and the reconstructed segments were analyzed by radiography, histology, 

rij 

Q histomorphometry, and biomechanical testing. Animals that received test implants were 
euthanized after 2,5 months because of the presence of well mineralized bone bridging the 
defect. The new bone allowed these animals to chew a solid diet, and the average bending 
strength of reconstructed mandibles was 27% of normal ('normal' in this case represents the 

25 unoperated, contralateral hemhnandible). In contrast, the implants in the other two groups 
were non-functional even after 6 months and showed minimal bone formation. 

Yasko et al , published a related study in which the effect of BMP-2 on the repair of 
segmental defects in the rat femur was exammed (Yasko et al , 1992). The study design 
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included a group that received a dose of 1.4 mg of BMP-2, another group that received 11.0 
mg of BMP-2, and a control group that received carrier matrix alone. Endochondral bone 
formation was observed in both groups of animals that received BMP-2. As demonstrated by 
radiography, histology, and whole bone (torsion) tests of mechanical integrity, the larger 
dose resulted in functional repair of the 5-nmi defect beginning 4.5 weeks after surgery. The 
lower dose resulted in radiographic and histological evidence of new bone formation, but 
functional union was not observed even a^er 9 weel:s post surgery. There was also no 
evidence of bone formation in control animals at this time. 

Chen et a/., showed that a single application of 25-100 mg of recombinant TGF-/31 
adjacent to cartilage induced endochondral bone formation in the rabbit ear full thickness skin 
wounds (Chen et al , 1991). Bone formation began 21 days following the creation of the 
wound and reached a peak at day 42, as demonstrated by morphological methods. Active 
bone remodeling was observed beyond this point. 

In a related study. Beck et al , demonstrated that a single application of TGF-^1 in a 
3% methylcellulose gel was able to repair surgically induced large skull defects that 
otherwise heal by fibrous connective tissue and never form bone (Beck et al, 1991). Bony 
closure was achieved withm 28 days of the application of 200 mg of TGF-jSl and the rate of 
healing was shown to be dose dependent. 

Studies such as those described above have thus established that exogenous growth 
factors can be used to stimulate new bone formation/repair/regeneration in vivo. Certain 
U.S. Patents also concern methods for treating bone defects or inducing bone formation. For 
example, U.S. Patent 4,877,864 relates to the administration of a therapeutic composition of 
bone inductive protein to treat cartilage and/or bone defects; U.S. Patent 5,108,753 concerns 
the use of a device containing a pure osteogenic protein to induce endochondral bone 
formation and for use in periodontal, dental or craniofacial reconstructive procedures. 
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5. Recombinant Expression 



10 



m 



The use of recombinant expression systems in the preparation of LTBP-2 and LTBP-3 
polypeptides is particularly contemplated. To express a recombinant LTBP-2 and LTBP-3 
polypeptides, whether mutant or wild-type, in accordance with the present invention one 
would prepare an expression vector that comprises an LTBP-2- or LTBP-3-encoding nucleic 
acid segment un'^er the control of one or more promoters. The "upstream" promoters 
stimulate transcription of the DNA and promote expression of the encoded recombinant 
protein. This is the meaning of "recombinant expression" in this context. 



Many standard techniques are available to construct expression vectors containing the 
appropriate nucleic acids and transcriptional/translational control sequences in order to 
achieve protein or peptide expression in a variety of host-expression systems. Cell types 
available for expression inchide, but are not limited to, bacteria, such as E, coli and 
B, subtilis transformed with recombinant bacteriophage DNA, plasmid DNA or cosmid DNA 
expression vectors. 



Certain examples of prokaryotic hosts are E, coli strain RRl, E. coli LE392, 
E. coli B, £. coli X 1776 (ATCC No. 31537) as well as £. coli W3110 (F-, lambda-, 
prototrophic, ATCC No. 273325); bacilli such as Bacillus subtilis\ and other 
enterobacteriaceae such as Salmonella typhimurium, Serratia marcescens, and various 
Pseudomonas species. 



In general, plasmid vectors containing replicon and control sequences which are 
25 derived from species compatible with the host cell are used in connection with these hosts. 
The vector ordinarily carries a replication site, as well as marking sequences which are 
capable of providing phenotypic selection in transformed cells. For example, E. coli is often 
transformed using pBR322, a plasmid derived from an E. coli species. pBR322 contains 
genes for ampicillin and tetracycline resistance and thus provides easy means for identifying 
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transformed cells. The pBR plasmid, or other microbial plasmid or phage must also contain, 
or be modified to contain, promoters which can be used by the microbial organism for 
expression of its own proteins. 

5 In addition, phage vectors containing replicon and control sequences that are 

compatible with the host microorganism can be used as transforming vectors in connection 
with these hosts. For example, the phage lambda GEM-U™ may be utilized in making a 
recombinant phage vector which can be used to transform host cells, such as E. coli LE392. 
Further useful vectors include pIN vectors (Inouye et al, 1985); and pGEX vectors, for use 
10 in generating glutathione S-transferase (GST) soluble fusion proteins for later purification and 
separation or cleavage. 

S Promoters that are most commonly used in recombinant DNA construction include the 

'=D /3-lactamase (penicillinase), lactose and tryptophan {trp) promoter systems. While these are 
H the most commonly used, other microbial promoters have been discovered and utilized, and 
kI details concerning their nucleotide sequences have been published, enabling those of skill in 
the art to ligate them functionally with plasmid vectors. 

^? 

For expression in Saccharomyces, the plasmid YRp7, for example, is commonly used 

% (Stinchcomb et al., 1979; Kingsman et al., 1979; Tschemper et al, 1980). This plasmid 

o 

0 already contains the trpL gene which provides a selection marker for a mutant strain of yeast 
lacking the ability to grow in tryptophan, for example ATCC No. 44076 or PEP4-1 (Jones, 
1977). The presence of die trpL lesion as a characteristic of the yeast host cell genome then 
provides an effective environment for detecting transformation by growth in the absence of 

25 tryptophan. 

Suitable promoting sequences in yeast vectors include the promoters for 3- 
phosphoglycerate kinase (Hitzeman et al., 1980) or other glycolytic enzymes (Hess et al., 
1968; Holland et al., 1978), such as enolase, glyceraldehyde-3-phosphate dehydrogenase, 
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hexokinase, pyravate decarboxylase, phosphofructokinase, glucose-6-phosphate isomerase, 3- 
phosphoglycerate mutase, pyruvate kinase, triosephosphate isomerase, phosphoglucose 
isomerase, and glucokinase. In constructing suitable expression plasmids, the termination 
sequences associated with these genes are also ligated into the expression vector 3' of the 
sequence desked to be expressed to provide polyadenylation of the mRNA and termination. 

Other suitable promoters, which have the additional advantage of transcription 
controlled by growth conditions, include the promoter region for alcohol dehydrogenase 2, 
isocytochrome C, acid phosphatase, degradative enzymes associated with nitrogen 
metabolism, and the aforementioned glyceraldehyde-3-phosphate dehydrogenase, and 
enzymes responsible for maltose and galactose utilization. 

In addition to micro-organisms, cultures of cells derived from multicellular organisms 
may also be used as hosts. In principle, any such cell culture is workable, whether from 
vertebrate or invertebrate culture. In addition to mammalian cells, these include insect cell 
systems infected with recombinant virus expression vectors (e.g., baculovirus); and plant cell 
systems infected with recombmant virus expression vectors (e.g., cauliflower mosaic virus, 
CaMV; tobacco mosaic virus, TMV) or transformed with recombinant plasmid expression 
vectors (e.g., Ti plasmid) containing one or more LTBP-2- or LTBP-3- encoding DNA 
sequences. 

In a useful insect system. Autograph califomica nuclear polyhidrosis virus (AcNPV) 
is used as a vector to express foreign genes. The virus grows in Spodoptera frugiperda cells 
The LTBP coding sequences are cloned mto non-essential regions (for example the 
polyhedrin gene) of the vims and placed under control of an AcNPV promoter (for example 
the polyhedrin promoter). Successful insertion of the coding sequences results in the 
inactivation of the polyhedrin gene and production of non-occluded recombinant vims (i.e., 
vims lacking the proteinaceous coat coded for by the polyhedrin gene). These recombinant 
virases are then used to infect Spodoptera frugiperda cells in which the inserted gene is 
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expressed (e.g,, U.S. Patent No. 4,215,051 issued to Smith). 



Examples of usefixl mammalian host cell lines are VERO and HeLa cells, Chinese 
hamster ovary (CHO) cell lines, W138, BHK, COS-7, 293, HepG2, 3T3, RIN and MDCK 
cell Imes. In addition, a host cell strain may be chosen that modulates the expression of the 
inserted sequences, or modifies and processes the gene product m the specific fashion 
desired. Such modifications (e,g., glycosylation) and processing (e.g,, cleavage) of protein 
products may be important for the function of the protein, particularly widi respect to the 
ability of LTBP-2 and LTBP-3 polypeptides to bind to TGF)3 proteins. 

Different host cells have characteristic and specific mechanisms for the post- 
translational processing and modification of proteins. Appropriate cells lines or host systems 
can be chosen to ensure the correct modification and processing of the foreign protein 
expressed. To this end, eukaryotic host cells may also be used if desired, with a cell that 
allows for high-level expression of LTBP-2 and LTBP-3 polypeptides being preferred. 

Expression vectors for use in mammalian such cells ordinarily include an origin of 
replication (as necessary), a promoter located in front of the gene to be expressed, along with 
any necessary ribosome binding sites, RNA splice sites, polyadenylation site, and 
transcriptional terminator sequences. The origm of replication may be provided either by 
construction of the vector to include an exogenous origin, such as may be derived from SV40 
or other viral (e.^., Polyoma, Adeno, VSV, BPV) source, or may be provided by the host 
cell chromosomal replication mechanism. If the vector is integrated into the host cell 
chromosome, the latter is often sufficient. 

The promoters may be derived from the genome of mammalian cells (e.g., 
metallothionein promoter) or from mammalian viruses (e.g., the adenovirus late promoter; 
the vaccinia virus 7.5K promoter). Further, it is also possible, and may be desirable, to 
utilize promoter or control sequences normally associated with the desired LTBP-encoding 
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gene sequence, provided such control sequences are compatible with the host cell systems. 

A number of viral based expression systems may be utilized, for example, conmionly 
used promoters are derived from polyoma. Adenovirus 2, and most frequently Simian Vims 
40 (SV40). The early and late promoters of SV40 vims are particularly useful because both 
are obtained easily from the vims as a fragment which also contains the SV40 viral origin of 
replication. Smaller or larger SV40 fragments may also be used, provided there is included 
the approximately 250 bp sequence extending from the Hind III site toward the Bgll site 
located in the viral origin of replication. 

In cases where an adenovims is used as an expression vector, the coding sequences 
may be ligated to an adenovims transcription/ translation control complex, e.g., the late 
promoter and tripartite leader sequence. This chimeric gene may then be inserted in the 
adenovims genome by in vitro or in vivo recombmation. Insertion in a non-essential region 
of the viral genome (e.g., region El or E3) will result in a recombinant vims that is viable 
and capable of expressing the LTBP polypeptides in infected hosts. 

Specific initiation signals may also be required for efficient translation of LTBP-2 and 
LTBP-3 codmg sequences. These signals include the ATG initiation codon and adjacent 
sequences. Exogenous translational control signals, including the ATG initiation codon, may 
additionally need to be provided. One of ordinary skill in the art would readily be capable of 
determining this and providing the necessary signals. It is well known that the initiation 
codon must be in-frame (or in-phase) with the reading frame of the desired coding sequence 
to ensure translation of the entire insert. These exogenous translational control signals and 
initiation codons can be of a variety of origins, both natural and synthetic. The efficiency of 
expression may be enhanced by the inclusion of appropriate transcription enhancer elements, 
transcription terminators (Bittner et al, 1987). 

For long-term, high-yield production of recombinant LTBP-2 or LTBP-3 proteins, 
stable expression is preferred. For example, cell lines that stably express constmcts 
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encoding LTBP-2 or LTBP-3 polypeptides may be engineered. Rather than using expression 
vectors that contain viral origins of replication, host cells can be transformed with vectors 
controlled by appropriate expression control elements (e.g,, promoter, enhancer, sequences, 
transcription terminators, polyadenylation sites, etc.), and a selectable marker. Following 
5 the mtroduction of foreign DNA, engineered cells may be allowed to grow for 1-2 days in an 
enriched media, and then are switched to a selective media. The selectable marker m the 
recombinant plasmid confers resistance to the selet don and allows cells to stably integrate the 
plasmid into their chromosomes and grow to form foci which in turn can be cloned and 
expanded into cell lines. 



A number of selection systems may be used, including, but not limited, to the herpes 
simplex vims thymidine kinase (Wigler et al, 1977), hypoxanthine-guanine 
S phosphoribosyltransferase (Szybalska et a/., 1962) and adenine phosphoribosyltransferase 
y genes (Lowy et al, 1980), in tk-, hgprt- or aprt- cells, respectively. Also, antimetabolite 
PdS resistance can be used as the basis of selection for dhfr, that confers resistance to 
'm methotrexate (Wigler et al. , 1980; O'Hare et al. , 1981); gpt, that confers resistance to 

mycophenolic acidJMuUigan etal^, 1981); neo, that confers resistance to the aminoglycoside 
O G-418 (Colberre-Garapin et al. , 1981); and hygro, that confers resistance to hygromycin 
y! (Santerre etai, 1984). 



Site-specific mutagenesis is a technique useful in the preparation of individual 
peptides, or biologically functional equivalent proteins or peptides, through specific 
25 mutagenesis of the underlying DNA. The technique further provides a ready ability to 

prepare and test sequence variants, for example, incorporating one or more of the foregoing 
considerations, by introducing one or more nucleotide sequence changes into the DNA. Site- 
specific mutagenesis allows the production of mutants through the use of specific 
oligonucleotide sequences which encode the DNA sequence of the desired mutation, as well 
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Site-Specific Mutagenesis 
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as a sufficient number of adjacent nucleotides, to provide a primer sequence of sufficient size 
and sequence complexity to form a stable duplex on both sides of the deletion junction being 
traversed. Typically, a primer of about 17 to 25 nucleotides in length is preferred, with 
about 5 to 10 residues on both sides of the junction of the sequence being altered. 

In general, die technique of site-specific mutagenesis is well known in the art, as 
exemplified by various publications. As will be appreciated, the technique typically employs 
a phage vector which exists in both a single stranded and double stranded form. Typical 
vectors useful in site-directed mutagenesis include vectors such as the M13 phage. These 
phage are readily commercially available and their use is generally well known to those 
skilled in the art. Double stranded plasmids are also routinely employed in site directed 
mutagenesis which eliminates the step of transferring the gene of interest from a plasmid to a 
phage. 

In general, site-directed mutagenesis in accordance herewith is performed by first 
obtaining a single-stranded vector or melting apart of two strands of a double stranded vector 
which includes within its sequence a DNA sequence which encodes the desired osteotropic 
protein. An oligonucleotide primer bearing the desired mutated sequence is prepared, 
generally synthetically. This primer is flien annealed with the single-stranded vector, and 
subjected to DNA polymerizing enzymes such as E, coli polymerase I Klenow fragment, in 
order to complete the synthesis of the mutation-bearing strand. Thus, a heteroduplex is 
formed wherein one strand encodes the original non-mutated sequence and the second strand 
bears the desired mutation. This heteroduplex vector is then used to transform appropriate 
cells, such as E, coli cells, and clones are selected which include recombinant vectors 
bearing the mutated sequence arrangement. 

The preparation of sequence variants of the selected LTBP-2 or LTBP-3 gene using 
site-directed mutagenesis is provided as a means of producing potentially useful species and 
is not meant to be limiting as there are other ways in which sequence variants of osteotropic 
genes may be obtained. For example, recombinant vectors encoding the desired osteotropic 
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gene may be treated with mutagenic agents, such as hydroxylamine, to obtain sequence 
variants. 

7. In Situ Hybridization 

The following technique describes the detection of mRNA in tissue obtained from the 
site of bone regeneration. This may be useful for d:tecting expression of ^ TBP-2 or LTBP- 
3. 

DNA from a plasmid contaming the gene for which mRNA is to be detected is 
linearized, extracted, and precipitated with ethanol. Sense and antisense transcripts are 
generated from 1 mg template with T3 and T7 polymerases, e.g., in the presence of P^S] 
UTP at >6 mCi/ml (Amersham Corp., > 1200 Ci/mmol) and 1.6 U/ml RNasin (Promega), 
with the remaining in vitro transcription reagents provided in a kit (SureSite, Novagen Inc.). 
After transcription at 37°C for 1 hour, DNA templates are removed by a 15 minute digestion 
at 37 °C with 0.5 U/ml RNase-free DNase I, extracted, and precipitated with ethanol. 
Riboprobes are hydrolyzed to an average final length of 150 bp by incubating in 40 mM 
NaHCOa, 60 mM NajCOa, 80 mM DTT at 60°C, according to previously determined 
formula. Hydrolysis is terminated by addition of sodium acetate, pH 6.0, and glacial acetic 
acid to 0.09 M and 0.005% (v/v), respectively, and the probes are then ethanol precipitated, 
dissolved in 0.1 M DTT, counted, and stored at -20°C until use. 

RNase precautions are taken in all stages of slide preparation. Bouins fixed, paraffin 
embedded tissue sections are heated to 65 °C for 10 minutes, deparaffinized in 3 changes of 
xylene for 5 minutes, and rehydrated in a descending ethanol series, ending in phosphate- 
buffered saline (PBS). Slides will be soaked in 0.2 N HCl for 5 min., rinsed in PBS, 
digested with 0.0002% proteinase K in PBS for 30 minutes at 37°C and rinsed briefly with 
DEPC-treated water. After equilibrating for 3 minutes in 0.1 M triethanolamine-HCl (TEA- 
HCl), pH 8.0, sections are acetylated in 0.25% (v/v) acetic anhydride in 0.1 M TEA-HCl for 
10 minutes at room temperature, rinsed in PBS, and dehydrated in an ascending ethanol 
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series. Each section receives 100-200 ml prehybridization solution (0.5 mg/ml denatured 
RNase-free tRNA (Boehringer-Mannheim), 10 mM DTT, 5 mg/ml denatured, sulfiirylated 
salmon sperm DNA, 50% formamide, 10% dextran sulfate, 300 mM NaCl, IX RNase-free 
Denhardt's solution (made with RNase-free bovine serum albumin, Sigma), 10 mM Tris- 
5 HCl, pH 7.4, 1 mM EDTA) and then incubate on a 50°C slide warmer in a humidified 

enclosure for 2 hours. The sulfiirylated sahnon-sperm DNA blocking reagent is used in both 
prehybridization nnd hybridization solutions to help reduce nonspecific bmding to tissue by 
^^SH groups on the probe. It is prepared by labeling RNase-free sahnon sperm DNA 
(Sigma) with non-radioactive a-thio-dCTP and a-thio-dATP (Amersham) in a standard 
10 random oligonucleotide-primed DNA labeling reaction. Excess prehybridization solution is 
removed with a brief rinse in 4X SSC before application of probe. 

Riboprobes, fresh tRNA and sulfiirylated salmon sperm DNA will be denatured for 10 
^ minutes at 70°C, and chilled on ice. Hybridization sohition, identical to prehybridization 
iU.5 solution except with denatured probe added to 5 x 10* CPM/ml, is applied and slides 

incubated at SCC overnight in sealed humidified chambers on a slide warmer. Sense and 
VI antisense probes are applied to serial sections. Slides are rinsed 3 times in 4X SSC, washed 
0 with 2X SSC, 1 mM DTT for 30 min. at 50°C, digested with RNase A (20 mg/ml RNase A, 
2 0.5 M NaCl, 10 mM Tris, pH 8.0, 1 mM EDTA, pH 8.0) for 30 min. at 37°C, and rinsed 
ri£0 briefly with 2X SSC, 1 mM DTT. Three additional washes are performed, each at 50 °C for 
5 30 minutes: once in 2X SSC, 50% formamide, 1 mM DTT, and twice in IX SSC, 0.13% 

(w/v) sodium pyrophosphate, 1 mM DTT. 

Slides are dehydrated in an ascending ethanol series (with supplementation of the 
25 dilute ethanols (50% and 70%) with SSC and DTT to O.IX and 1 mM, respectively). Slides 
are exposed to X-ray fiUn for 20-60 hours to visualize overall hybridization patterns, dipped 
in autoradiographic emulsion (Kodak NTB-2, diluted to 50% with 0.3 M ammonium acetate), 
slowly dried for 2 hours, and exposed (4°C) for periods ranging from 8 days to 8 weeks. 
After developing emulsion, sections are counter strained with hematoxylin and eosin, 
30 dehydrated, and mounted with xylene-based medium. The hybridization signal is visualized 
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under darkfield microscopy. 

The above in situ hybridization protocol may be used, for example, in detecting the 
temporal and spatial pattern of PTH/PTHrP receptor expression. A suitable rat PTH/PTHrP 
5 receptor cDNA probe (R15B) is one that consists of a 1810 bp region encoding the full 
length rat bone PTH/PTHrP receptor (Abou-Samra et al, 1992). The cDNA fragment is 
subcloned into p'DNA 1 (Invitrogen Corp., San Diego, CA) and is cut out using Xbal and 
5amHI. This probe has provided positive signals for northern blot analysis of rat, murine, 
and human osteoblastic cell lines, rat primary calvarial cells, and murine bone tissue. The 
10 pcDNA I plasmid contains a T7 and SP6 promoter that facilitate the generation of cRNA 
probes for in situ hybridization. The full length transcript has been used to detect 
PTH/PTHrP receptor in sections of bone (Lee et al. , 1994). The PTHrP cDNA probe 
% (Yasuda et al. , 1989) is a 400 bp subcloned fragment in pBluescript (Stratagene). This probe 
^2 has been used for in situ hybridization, generating an antisense cRNA probe using Bamttl 
MS cleavage and the T3 pruner and a sense cRNA probe using EcoKl cleavage and the T7 
prmier. 

Ul 

Q 8. Monoclonal Antibody Generation 

TT'i 
fc? ■ 

rid: 

%Q Means for preparing and characterizing antibodies are well known in the art (See, 

p e,g,. Antibodies: A Laboratory Manual, Cold Spring Harbor Laboratory, 1988; mcorporated 

herein by reference). 

The methods for generating monoclonal antibodies (MAbs) generally begin along the 
25 same lines as those for preparing 'polyclonal antibodies. Briefly, a polyclonal antibody is 

prepared by immunizing an animal with an immunogenic composition in accordance with the 
present invention and collecting antisera from that inmiunized animal. A wide range of 
animal species can be used for the production of antisera. Typically the animal used for 
production of anti-antisera is a rabbit, a mouse, a rat, a hamster, a guinea pig or a goat. 
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Because of the relatively large blood volume of rabbits, a rabbit is a preferred choice for 
production of polyclonal antibodies. 

As is well known in the art, a given composition may vary in its immunogenicity. It 
is often necessary therefore to boost die host immune system, as may be achieved by 
coupling a peptide or polypeptide inmiunogen to a carrier. Exemplary and preferred carriers 
are keyhole limpet hemocyanin (KLH) and bovine serum albumin (BSA). Other albumins 
such as ovalbumin, mouse serum albumin or rabbit serum albumm can also be used as 
carriers. Means for conjugating a polypeptide to a carrier protein are well known in the art 
and include glutaraldehyde, m-maleimidobencoyl-N-hydroxysuccmimide ester, carbodiimyde 
and bis-biazotized benzidine. 

As is also well known in the art, the immunogenicity of a particular unmunogen 
composition can be enhanced by the use of non-specific stimulators of the immune response, 
known as adjuvants. Exemplary and preferred adjuvants include complete Freund's adjuvant 
(a non-specific stimulator of the immune response containing killed Mycobacterium 
tuberculosis), incomplete Freund's adjuvants and aluminum hydroxide adjuvant. 

The amount of immunogen composition used in the production of polyclonal 
antibodies varies upon the nature of the immunogen as well as the animal used for 
immunization. A variety of routes can be used to administer the immunogen (subcutaneous, 
intramuscular, intradermal, intravenous and intraperitoneal). The production of polyclonal 
antibodies may be monitored by sampling blood of the immunized animal at various points 
following immunization. A second, booster injection, may also be given. The process of 
boosting and titering is repeated until a suitable titer is achieved. When a desired level of 
immunogenicity is obtained, the immunized animal can be bled and the serum isolated and 
stored, and/or the animal can be used to generate MAbs. 

MAbs may be readily prepared through use of well-known techniques, such as those 
exemplified m U.S. Patent 4,196,265, incorporated herein by reference. Typically, this 
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technique involves immunizing a suitable animal with a selected immunogen composition, 
e.g., a purified or partially purified LTBP-2 or LTBP-3 protein, polypeptide or peptide. The 
immunizing composition is administered in a manner effective to stimulate antibody 
producing cells. Rodents such as mice and rats are preferred animals, however, the use of 
rabbit, sheep frog ceUs is also possible. The use of rats may provide certain advantages 
(Coding, 1986, pp. 60-61), but mice are preferred, with the BALB/c mouse being most 
preferred as this is most routinely used and generally gives a higher percentage of stable 
fusions. 

Following immunization, somatic cells with the potential for producing antibodies, 
specifically B lymphocytes (B cells), are selected for use in the MAb generating protocol. 
These cells may be obtained from biopsied spleens, tonsils or lymph nodes, or from a 
peripheral blood sample. Spleen ceUs and peripheral blood cells are preferred, the fonner 
because they are a rich source of antibody-producing cells that are in the dividing plasmablast 
stage, and the latter because peripheral blood is easily accessible. Often, a panel of animals 
will have been immunized and the spleen of animal with the highest antibody titer will be 
removed and the spleen lymphocytes obtained by homogenizing the spleen with a syringe. 
Typically, a spleen from an inmiunized mouse contains approximately 5 X 10^ to 2 X 10^ 
lymphocytes. 

The antibody-producing B lymphocytes from the immunized animal are then fused 
with cells of an immortal myeloma cell, generally one of the same species as the animal that 
was immunized. Myeloma cell lines suited for use in hybridoma-producing fusion 
procedures preferably are non-antibody-producing, have high fusion efficiency, and enzyme 
deficiencies that render then incapable of growing in certain selective media which support 
the growth of only the desired fused cells (hybridomas). 

Any one of a number of myeloma cells may be used, as are known to those of skill in 
the art (Coding, pp. 65-66, 1986; Campbell, pp. 75-83, 1984). For example, where the 
immunized animal is a mouse, one may use P3-X63/Ag8, X63-Ag8.653, NSl/l.Ag 4 1, 
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Sp210-Agl4, FO, NSO/U, MPC-11, MPC11-X45-GTG 1.7 and S194/5XX0 Bui; for rats, 
one may use R210.RCY3, Y3-Ag 1.2.3, IR983F and 4B210; and U-266, GM1500-GRG2, 
LICR-L0N-HMy2 and UC729-6 are all useful in connection with human cell fusions. 

One preferred murine myeloma cell is the NS-1 myeloma cell line (also termed P3- 
NS-l-Ag4-l), which is readily available from the NIGMS Human Genetic Mutant Cell 
Repository by requestmg cell line repository number GM3573. Another mouse myeloma cell 
line that may be used is the 8-azaguanine-resistant mouse murine myeloma SP2/0 
non-producer cell line. 

Methods for generating hybrids of antibody-producing spleen or lymph node cells and 
myeloma cells usually comprise mixing somatic cells with myeloma cells in a 2:1 proportion, 
though the proportion may vary from about 20:1 to about 1:1, respectively, in the presence 
of an agent or agents (chemical or electrical) that promote the fusion of cell membranes. 
Fusion methods using Sendai virus have been described by Kohler and Milstein (1975; 
1976), and those using polyethylene glycol (PEG), such as 37% (v/v) PEG, by Gefter et al 
(1977). The use of electrically mduced fusion methods is also appropriate (Goding pp. 
71-74, 1986). 

Fusion procedures usually produce viable hybrids at low frequencies, about 1 x 10*^ 
to 1 X 10"^ However, this does not pose a problem, as the viable, fused hybrids are 
differentiated from the parental, unfiised cells (particularly the unfiised myeloma cells that 
would normally continue to divide indefinitely) by culturing in a selective medium. The 
selective medium is generally one that contains an agent that blocks the de novo synthesis of 
nucleotides in the tissue culture media. Exemplary and preferred agents are aminopterin, 
methotrexate, and azaserine. Aminopterin and methotrexate block de novo synthesis of both 
purines and pyrimidines, whereas azaserine blocks only purine synthesis. Where aminopterin 
or methotrexate is used, the media is supplemented with hypoxanthine and thymidine as a 
source of nucleotides (HAT medium). Where azaserine is used, the media is supplemented 
with hypoxanthine. 
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The preferred selection medium is HAT. Only cells capable of operating nucleotide 
salvage pathways are able to survive in HAT medium. The myeloma cells are defective in 
key enzymes of the salvage pathway, e.g., hypoxanthine phosphoribosyl transferase (HPRT), 
and they cannot survive. The B cells can operate this pathway, but they have a limited life 
5 span in culture and generally die within about two weeks. Therefore, the only cells that can 
survive in the selective media are those hybrids formed from myeloma and B cells. 

This culturing provides a population of hybridomas from which specific hybridomas 
are selected. Typically, selection of hybridomas is performed by culturing the cells by 
10 single-clone dilution in microtiter plates, followed by testing the individual clonal 

supematants (after about two to three weeks) for the desired reactivity. The assay should be 
sensitive, simple and rapid, such as radioimmunoassays, enzyme immunoassays, cytotoxicity 
5 assays, plaque assays, dot immunobinding assays, and the like. 
If! 

nis The selected hybridomas would then be serially diluted and cloned into individual 

5 antibody-producing cell lines, which clones can then be propagated indefinitely to provide 
m MAbs. The cell lines may be exploited for MAb production in two basic ways. A sample 
% of the hybridoma can be injected (often into the peritoneal cavity) into a histocompatible 
0^ animal of the type that was used to provide the somatic and myeloma cells for the original 
ri|0 fusion. The injected animal develops tumors secreting the specific monoclonal antibody 
y produced by the fused cell hybrid. The body fluids of the animal, such as serum or ascites 
fluid, can then be tapped to provide MAbs in high concentration. The individual cell lines 
could also be cultured in vitro, where the MAbs are naturally secreted into the culture 
medium from which they can be readily obtained in high concentrations. MAbs produced by 
25 either means may be further purified, if desired, using filtration, centrifugation and various 
chromatographic methods such as HPLC or affinity chromatography. 

9. LTBP Structure and Function 

30 The LTBPs appear to be modular polypeptides characterized by the presence of 
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multiple cysteine-rich motifs. Molecular cloning of human LTBP-1 (Kanzaki et al, 1990) 
indicates, for example, that the molecule consists of 16 epidermal growth factor-like repeats 
with the potential to bind calcium (EGF-CB repeats), 3 copies of a unique motif containing 8 
cysteine residues, an RGD motif, and an 8 amino acid motif identical to the cell binding 
domain of the laminin j82 chain. EGF-CB repeats may be modified to contain 
hydroxyaspartic acid and hydroxyasparagine (Stenflo et al. , 1987). The genes that code for 
two LTBPs have been isolated previously. In sharp contirast to those of the prior art, the 
LTBP-3 sequence of the present invention shares only 40% sequence identity with those of 
the LTBP-1 sequence (Kanzaki et al, 1990) and the LTBP-2 sequence (Moren et al, 1994. 

Unlike the human LTBP genes isolated previously which are localized to human 
chromosome 2, band pl2-q22 (LTBP-1, Stenman et al, 1994) and human chromosome 14, 
band q24 (LTBP-2, Moren et al , 1994), the LTBP-3 of the present invention is localized to 
chromosome 11, band ql2. 

One aspect of the present invention is the mapping of the murine LTBP-3 to mouse 
chromosome 19, band B, a region of conserved synteny with human chromosome 11, band 
ql2. 

While the function of LTBP is unknown (i.e. , studies with transfected CHO cells 
indicate that LTBP does not contribute to TGF-^ latency), several ideas have been proposed 
that, when taken together, suggest that LTBP may function as a extiracellular stiiictural 
protein capable of both regulating and targeting TGF-)3 activity. First, LTBP may regulate 
the inti^cellular biosynthesis of latent TGF-jS. Cultured erythroleukemia cells efficientiy 
assemble and secrete large latent TGF-|8 complexes, but they poorly secrete latent TGF-/3 
complexes that lack LTBP-1 (Miyazono et al, 1991; 1992). The retained complexes contain 
anomalous disulfide bonds, suggesting that, for erytiiroleukemia cells at least, LTBP 
contributes to the normal assembly and secretion of TGF-/3 latent complexes. Second, LTBP 
may function to target latent TGF-/3 to specific types of connective tissue. Recent evidence 
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suggests that the large latent TGF-|S complex covalenay binds the extracellular matrix via 
LTBP-1, i.e., LTBP-1 may target the latent TGF-jS complex to a site near the cell surface 
(pericellular matrix) to facUitate the generation of autocrine or paracrine effects (Flaumenhaft 
et ai, 1993; Taipale et al, 1994; Taipale and Keski-Oja, 1992). Third, LTBP may 
modulate the activation of latent complexes. There is direct evidence that LTBP-1 binds 
calcium (Colosetti et al, 1993) and that calcium binding induces a structural change that 
protects LTBP-1 from proteolytic attack. As described above (Lyons et al, 1988; Taipale et 
al, 1995), latent TGF-j8 is exposed to protease-rich environments during wound repair and 
normal development and exposure to these environments leads to the release of mature TGF- 
/3 fi:om extracellular storage sites. It is therefore possible that the protease-resistant 
conformation of LTBP helps ensure TGF-/3 integrity in vivo. 

10. LTBP and Skeletal Tissues 

Skeletal tissue represents one of the largest known repositories of latent TGF-/3 (200 
/ig/kg bone; Seyedin et al, 1986; Seyedin et al, 1987). Moreover, activated TGF-^ may 
stimulate bone formation in developing tissues and may act as a "coupling factor" that 
coordinates matrix resorption and formation during bone remodeling (Centrella et al, 1991). 
Finally, activated TGF-jS may exert a powerful osteoinductive stimulus following fracture 
(Joyce et al , 1990; Beck et al , 1993). 

It has not yet been determmed if TGF-/3 normally exists in bone as a small or large 
latent complex. Previous studies (Pfeilschifter et al , 1990; Bonewald et al , 1991) have 
shown that the major form of TGF-/3 in conditioned media from bone organ cultures is a 100 
kDa latent complex that lacks LTBP-1. Additionally, Dallas et al (Dallas et al , 1994) have 
demonstrated that cultured MG63, ROS 17/2.8, and UMR-106 cells (derived from 
osteosarcomas of various types) each secrete two major forms of latent TGF-/31, namely, a 
290 kDa complex that contains LTBP-1 and a 100 kDa complex lacking LTBP-1. A second 
high molecular weight complex that contained latent TGF-|82 and LTBP was also identified. 
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The presence and relative amount of low and high molecular weight complexes varied with 
cell type, and TGF-jSl and LTBP did not appear to be co-expressed in bone cells. These 
results led the authors to conclude that the 100 kDa latent TGF-/3 complex is a 
physiologically important form in bone cells, i.e., LTBP did not appear to be required for 
5 the proper and efficient assembly and secretion of small latent complexes. 

At the end of their paper, Dallas et al. also acknowledged that the production of small 
versus large latent TGF-jS complexes may be associated with specific stages in the maturation 
of bone cells — e.g., MG63, ROS 17/2.8, and UMR-106 cells are known to express 
10 different subsets of mature osteoblast phenotypic markers, which could explain differences m 
the size of the latent TGF-|8 complexes produced by the various osteosarcoma cell lines. 
Along this line, the inventors' laboratory has recently shown that mouse pre-osteoblast 
O MC3T3-E1 cells express the ltbp-3 gene at the outset of osteoblast differentiation and that the 
Ul LTBP-3 polypeptide binds TGF-j81 in MC3T3-E1 conditioned media (Yin et al , 1995). 
%5 Therefore, MC3T3-E1 cells express large latent TGF-jSl complexes bearing LTBP-3 
5 precisely at the time of the pre-osteoblast to osteoblast transition {i.e. , at -day 14 in culture, 

£0 

ITI or at the onset of alkaline phos phata se expression). 

^ Extending the MC3T3-E1 data further, the mventors' laboratory has also found that 

[go ltbp-2 and ltbp-3 are co-expressed with TGF-i3 in developing mouse skeletal tissues (Yin et 
y al. , 1995) and that ltbp-3 (at least) is expressed at sites of osteotomy repair in vivo (Yin et 
al, 1994). 

There is little doubt that TGF-(3 contributes to the normal structure and function of 
25 skeletal tissues. Bone is an abundant source of latent TGF-/3, and mature TGF-^ contributes 
to the processes of skeletal morphogenesis, bone remodeling, and bone repair. 
Consequently, bone must regulate the autocrine and paracrine effects of TGF-/3 with 
precision. The mventors' laboratory was the fnst to clone and map the mouse ltbp-2 and 
ltbp-3 genes, and it has obtained evidence that both genes are expressed during normal 
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murine skeletal morphogenesis and during bone osteotomy repair. In addition, in MC3T3-E1 
pre-osteoblasts TGF-/3 is synthesized as a homodimer known as the small latent complex that 
covalentiy binds LTBP to form large latent complexes. These results suggest for the first 
time that LTBP facilitates the assembly and secretion of latent TGF-^ complexes and targets 
latent TGF-jS to bone matrix. With the availability of the ltbp-2 and ltbp-3 genes, the 
opportunity exists in the mventors' laboratory to gain further insight into LTBP structure and 
function and, in turn, the mechanism by which lat;nt TGF-/3 complexes can be targeted to 
bone matrix and cells in a controlled manner. 




10 ^^^^ 11. Detection of LTBP-encoding DNA Segments 



The amount of an LTBP-2 or LTBP-3-encoding DNA segment present within a 
9 biological sample, such as blood, serum or PBMC sample, may be determined by means of a 
'Ji molecular biological assay to determine the level of a nucleic acid that encodes such an 
^5 LTBP-2 or LTBP-3 polypeptide, or by means of an immunoassay to determine the level of 
the polypeptide itself. 

|fi ^ ^ 

In a molecular biological method for detecting a cell that produces LTBP-2 or LTBP- 
ff"^ 3, one would obtain nucleic acids from one or more cells and analyze the nucleic acids to 
So identify a nucleic acid segment that encodes LTBP-2 or LTBP-3. Such nucleic acids may be 
identified by length, where an appropriate assay would be a PCR™-based assay resulting in 
the identification of an LTBP-2 or LTBP-3 -encoding mRNA transcript. Alternatively, the 
nucleic acid segment may be identified by sequence, which method generally includes 
identifying a transcript with a sequence of the present invention e.g., by Northern or 
25 Southern blotting using a discriminating probe prepared in accordance with SEQ ID NO: 1 or 
SEQ ID N0:3. 



The detection of a cell that produces LTBP-2 or LTBP-3-encoding DNA segment 
using a method based upon the sequence of an ltbp-2 or ltbp-3 transcript requires an ltbp-2 or 
30 ltbp-3 probe with a novel DNA sequence as disclosed herein. This imparts an evident utility 
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to the nucleic acid segments of the present invention, particularly the shorter 



The presence of a substantially complementary nucleic acid sequence in a sample, or 
a significantly increased level of such a sequence in comparison to the levels in a normal or 
"control" sample, wiU thus be indicative of a sample that contains a cell that harbors an 
LTBP-2 or LTBP-3-encoding DNA segment. Here, substantially complementary LTBP-2 or 
LTBP-3-encoding nucleic acid sequences are those that have relatively little sequence 
divergence and that are capable of hybridizing under relatively stringent conditions, as 
discussed above. 

A variety of hybridization techniques and systems are known that can be used in 
connection with the detection aspects of the invention, including diagnostic assays such as 
those described in Falkow et al. , U.S. Patent 4,358,535. Short coding or non-coding nucleic 
acid segment probes may also be employed as primers in connection with diagnostic PGR™ 
technology, as well as for use in any of a number of other PGR™ applications, including 
PGR™-based cloning and engineering protocols. 

In general, the "detection" of an LTBP-2 or LTBP-3-encoding DNA segment is 
accomplished by attaching or incorporating a detectable label into the nucleic acid segment 
used as a probe and "contacting" a sample with the labeled probe. In such processes, an 
effective amount of a nucleic acid segment that comprises a detectable label (a probe), is 
brought into direct juxtaposition with a composition containing target nucleic acids. 
Hybridized nucleic acid complexes may then be identified by detecting the presence of the 
label, for example, by detecting a radio, enzymatic, fluorescent, or even chemiluminescent 
label. 

Many suitable variations of hybridization technology are available for use in the 
detection of nucleic acids, as will be known to those of skill in the art. These include, for 
example, in situ hybridization. Southern blotting and Northern blotting. In situ hybridization 
describes the techniques wherein the target nucleic acids contacted with the probe sequences 
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are those located within one or more cells, such as cells within a clinical sample or even 
cells grown in tissue culmre. As is well known in the art, the cells are prepared for 
hybridization by fixation, e.g.. chemical fixation, and placed in conditions that allow for tiie 
hybridization of a detectable probe witii nucleic acids located within the fixed cell. 

5 

Alternatively, target nucleic acids may be separated from a cell or sample prior to 
contact witii a probe. Any of the wide variety of methods for isolating target nucleic acids 
may be employed, such as cesium chloride gradient centrifugation, chromatography (e.g., 
ion, affinity, magnetic), phenol extraction and the like. Most often, the isolated nucleic acids 
10 will be separated, e.g., by size, using electirophoretic separation, followed by immobUization 
onto a solid matiix, prior to contact wifli die labelled probe. These prior separation 
techniques are frequently employed in die art and are generally encompassed by the terms 
"Soutiiem blotting" and "Northern blotting". Altiiough tiie execution of various techniques 
using labeled probes to detect LTBP-2 or LTBP-3-encoding DNA or RNA sequences in 
fls clinical samples are well known to tiiose of skUl in the art, a particularly preferred metiiod is 

it; 

described in detail herem, in Examples 1 and 3. 

Kits for use in Southern and Northern blotting to identify LTBP-2 or LTBP-3- 
encoding DNA segments are also contemplated to fall witiiin the scope of tiie present 
invention. Such kits will generally comprise, in suitable container means, ltbp-2 or ltbp-3 



in 



y 5 



-50- 



nucleic acid probes; unrelated probes for use as controls; and optionally, one or more 
restriction enzymes. 

12. Construction of Chimeric Promoter-Reporter Expression Plasmids 

5 

Using the 5' upstream flanlcing sequence of the LTBP-2 and LTBP-3 genes, a series 
of restriction fragments from available phage genomic inserts may be generated and 
subcloned into a promoter-reporter expression plasmid. An example of such a vector is 
pGL3, a luciferase reporter vector (Promega), which has a strong 5' terminator of 
10 transcription, a multiple cloning site, the cDNA coding sequence of insect luciferase, a 

strong translation stop codon, and an intron/polyadenylation signal sequence derived from 
SV40. 

[f j By assembling 9-15 overlapping promoter-reporter expression plasmids that cover an 

lis about 5 kb region of interest, the identification of potential m-acting elements is 

ff^ contemplated. Once the initial constructs have been characterized, regions of 5' upstream 

01 

IJl flanking sequence that show strong promotion (or repression) of gene expression may be 

L studied in careful detail. 

ff\ 

5=|0 The Northern analysis and tissue in situ hybridization data presented herein suggest 

S! that ltbp-3 is highly expressed m developing mouse tissues. Additionally, die Itbp-S 

transcript appears to be highly expressed by MC3T3-E1 cells in culture. The structural 
features of this sequence are consistent with a so-called housekeeping gene, i.e., a TATA- 
less and CAATT-less promoter sequence that is 70% GC-rich. 

25 

To overcome potential problems of low reporter gene expression, the inventors 
contemplate the use of pEU-CAT, a promoter-less CAT vector especially constructed for the 
analysis of weak promoters (Harduin-Lepers et al , 1993). Alternatively, modifying the 
reporter constructs by the addition of an Itbp-S enhancer element is also contemplated to be 
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useful. 

13. Evaluating Promoter Function 

5 Promoter function may be evaluated by in vitro transfection studies using L cells or 

NIH3T3 cells, since they have been used successfiiUy for this purpose. An interesting 
alternative is to use Drosophila SH cells which: lack the trans-acting factor Spl; co- 
transfection of SL2 cells with a plasmid encoding the Spl protein such as pP^cSpl and a 
reporter construct containing putative Spl binding sites will result in reporter expression if 
10 the sites are functional (Courey and Tjian, 1988). L cell transfection is performed using 

standard protocols (Sambrook et al, 1989). Briefly, subconfluent cells (covering -20% of 
a 100 mm plastic tissue culture dish) are washed 2x in DMEM tissue culmre medium 
O (GIBCO) and then incubated for 3 hrs. at 37°C in a sterile mixmre of DEAE-dextran (0.25 
5 mg/ml), chloroquine (55 mg/ml), and 15 /tg plasmid DNA (Courey and Tjian, 1988). Cells 
fls are then shocked by incubation with 10% DMSO in sterile PBS for 2 min. at 37°C, washed 

CP 2x with DMEM (Sambrook et al , 1989), and incubated in DMEM plus 10% fetal calf serum 

ffl 

m and antibiotics forJ72 hr. at 37°C^ Since DEAE-dextran is quite toxic to NIH3T3 cells, 
these cells may be transfected by electroporation.] 

^ Using the pGL3 reporter/expression vector, luciferase activity may be measured in 

2 cell lysates using Luciferase Assay System kit reagents (Promega) according to protocols 

provided by the manufacturer. To measure background CPM, 20 ^1 of cell lysate is added to 
a clean microcentrifuge tube and light activity is measured in a scintillation counter (System 
1400 scintillation counter, Wallac Nuclear; all channels open). The same procedure is used 
25 to measure background CPM of 100 /xl luciferase substrate stock solution. Once the 

background CPM have been documented, homogenate and substrate are mixed and light 
emission is measured immediately. Enzyme activity values are routinely obtained in 
triplicate, normalized to 1 mg of total protem, and expressed as a mean value ± the standard 
deviation. Student's t test is used to determine statistical significance of differences among 
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groups (95% confidence level). 

In one study, following transfection of COS cells with the pGL3 luciferase expression 
plasmid, an aliquot of a cell homogenate was assayed for enzyme activity using commercially 
5 available kits and according to the manufacturer's recommendations. An equal aliquot of a 
cell homogenate prepared from untransfected COS served as a negative control. Significant 
luciferase activity was found only in the homogenat; from transfected cells. 

14. Biological Functional Equivalents 

10 

As mentioned above, modification and changes may be made in the structure of an 
osteotropic gene and still obtain a functional molecule that encodes a protein or polypeptide 
O with desirable characteristics. The following is a discussion based upon changing the amino 
5 acids of a protein to create an equivalent, or even an improved, second-generation molecule. 
%5 The amino acid changes may be achieved by changing the codons of the DNA sequence, 
ff= according to the following codon table: 

in _ _ . 

m 

c 'sir 
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Table 1 



Codons 



10 



4=1 



lib 

£ 



30 



Alanine 


Ala 


A 


GCA 


GCC 


GCG 


GCU 






Cysteine 


Cys 


C 


UGC 


UGU 










Aspartic acid 


Asp 


D 


GAC 


GAU 










Glutamic acid 


Glu 


E 


GAA 


GAG 










Phenylalanine 


Phe 


F 


UUC 


UUU 










Glycine 


Gly 


G 


GGA 


GGC 


GGG 


GGU 






Histidine 


His 


H 


CAC 


CAU 










Isoleucine 


He 


I 


AUA 


AUC 


AUU 








Lysine 


Lys 


K 


AAA 


AAG 










Leucine 


Leu 


L 


UUA 


UUG 


CUA 


cue 


CUG 


CUU 


Methionine 


Met 


M 


AUG 












Asparagine 


Asn 


N 


AAC 


AAU 










Proline 


Pro 


P 


CCA 


CCC 


CCG 


ecu 






Glutamine 


Gin 


Q 


CAA 


CAG 










Arginine 


Arg 


R 


AGA 


AGG 


CGA 


CGC 


CGG 


CGU 


Serine 


Ser 


S 


AGC 


AGU 


UCA 


ucc 


UCG 


ucu 


Threonine 


Thr 


T 


ACA 


ACC 


ACG 


ACU 






Valine _ 


Val _ 


_ V 


GUA 


GUC 


GUG 


GUU 






Tryptophan 


Trp 


w 


UGG 












Tyrosine 


Tyr 


Y 


UAC 


UAU 











For example, certain amino acids may be substituted for other amino acids in a 
protein structure without appreciable loss of interactive binding capacity with structures such 
as, for example, antigen-binding regions of antibodies or binding sites on substrate 
molecules. Since it is the interactive capacity and nature of a protein that defines that 
protein's biological functional activity, certain amino acid sequence substitutions can be made 
in a protein sequence, and, of course, its underlying DNA coding sequence, and nevertheless 
obtain a protein with like properties. It is thus contemplated by the inventors that various 
changes may be made in the DNA sequences of osteotropic genes without appreciable loss of 
their biological utility or activity. 
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In making such changes, the hydropathic index of amino acids may be considered. 
The importance of the hydropathic amino acid index in conferring interactive biologic 
function on a protein is generally understood in the art (Kyte and Doolittle, 1982, incorporate 
herein by reference). It is accepted that the relative hydropafliic character of the amino acid 
contributes to the secondary structure of the resultant protein, which in turn defines the 
interaction of the protein with other molecules, for example, enzymes, substrates, receptors, 
DNA, artibodies, antigens, -^nd the like. 

Each amino acid has been assigned a hydropathic index on the basis of their 
hydrophobicity and charge characteristics (Kyte and Doolittle, 1982), these are: Isoleucine 
(+4.5); valine (+4.2); leucine (+3.8); phenylalanine (+2.8); cysteine/cystine (+2.5); 
methionme (+1.9); alanine (+1.8); glycine (-0.4); threonine (-0.7); serine (-0.8); tryptophan 
(-0.9); tyrosine (-1.3); proline (-1.6); histidine (-3.2); glutamate (-3.5); glutamine (-3.5); 
aspartate (-3.5); asparagine (-3.5); lysine (-3.9); and arginine (-4.5). 

It is known in the art that certain amino acids may be substituted by other amino acids 
having a similar hydropathic index or score and still result in a protein with similar 
biological activity, i.e., still obtain a biological functionally equivalent protein. In making 
such changes, the substitution of amino acids whose hydropathic indices are within ±2 is 
preferred, ±ose which are within ±1 are particularly preferred, and those within ±0.5 are 
even more particularly preferred. 

It is also understood in the art that the substitution of like amino acids can be made 
effectively on the basis of hydrophilicity. U.S. Patent 4,554,101, incorporated herein by 
reference, states that the greatest local average hydrophilicity of a protein, as governed by 
the hydrophilicity of its adjacent amino acids, correlates with a biological property of the 
protein. 

As detailed in U.S. Patent 4,554,101, the following hydrophilicity values have been 
assigned to amino acid residues: arginine (+3.0); lysine (+3.0); aspartate (+3.0 ± 1); 
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glutamate (+3.0 ± 1); serine (+0.3); asparagine (+0.2); glutamine (+0.2); glycine (0); 
threonine (-0.4); proline (-0.5 ± 1); alanine (-0.5); histidine *-0.5); cysteine (-1.0); 
methionine (-1.3); valine (-1.5); leucine (-1.8); isoleucine (-1.8); tyrosine (-2.3); 
phenylalanine (-2.5); tryptophan (-3.4). 

5 

It is understood that an amino acid can be substituted for another having a similar 
hydrcphilicity v. lue and stUl obtain a biologically equivalent, and in particular, an 
immunologically equivalent protem. In such changes, the substitution of amino acids whose 
hydrcphilicity values are within ±2 is preferred, those which are within +1 are particularly 
10 preferred, and those within ±0.5 are even more particularly preferred. 

As outlined above, amino acid substitutions are generally therefore based on the 
O relative similarity of the amino acid side-chain substituents, for example, their 
m hydrophobicity, hydrophilicity, charge, size, and the like. Exemplary substitutions which 
II5 take various of the foregoing characteristics mto consideration are well known to those of 
5 skill in the art and include: arginine and lysine; glutamate and aspartate; serine and 
threonine; glutamine and asparagine; and valine, leucine and isoleucme. 



mo 



5 The following examples are included to demonstrate preferred embodiments of the 

invention. It should be appreciated by those of skill in the art that the techniques disclosed 
in the examples which follow represent techniques discovered by the inventors to function 
well in the practice of the invention, and thus can be considered to constimte preferred 
25 modes for its practice. However, those of skill in the art should, in light of the present 

disclosure, appreciate that many changes can be made in the specific embodiments which are 
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disclosed and still obtain a like or simUar result without departing from the spirit and scope 
of the invention. 

EXAMPLE I 

ISOLATION OF A NOVEL LATENT TGF-/3 
BINDING PROTETN-LIKE LTBP-3 GENE 

The TGF-jSs represent a family of structurally related molecules with diverse effects 
on mammalian cell shape, growth, and differentiation (Roberts and Spom, 1990). Initially 
synthesized as a precursor consisting of an amino-terminal propeptide followed by mature 
TGF-/3, two chains of nascent pro-TGF-j8 associate in most tissues to form a Mr ~ 106,000 
inactive disulfide-bonded dimer. Homodimers are most common, but heterodimers have also 
been described (Cheifetz et al, 1987; Ogawa et al, 1992). During biosynthesis the mature 
TGF-(3 dimer is cleaved from the propeptide dimer. TGF-jS latency results in part from the 
non-covalent association of propeptide and mature TGF-i3 duners (Pircher et al, 1984 and 
1986; Wakefield et al, 1987; Millan et al, 1992; see also Miyazono and Heldin, 1989). 
Consequently, the propeptide dimer is often referred to as the latency associated protein 
(LAP), and LAP plus the disulfiae-bonded TGF-j8 dimer are also known as the small latent 
complex. In the extracellular space small latent complexes must be dissociated to activate 
mamre TGF-/3. The mechanism of activation of the latent complex is thought to be one of 
the most important steps governing TGF-jS effects (Lyons et al , 1988; Antonelli-Orlidge et 
al, 1989; Twardzik et al, 1990; Sato et al, 1993). 

In certain lines of cultured cells small latent growth factor complexes may contam 
additional high molecular weight proteins. The best characterized of these high molecular 
weight protems is the latent TGF-|3 binding protein, or LTBP (Miyazono et al , 1988; 
Kanzaki et al, 1990; Tsuji et al, 1990; Olofsson et al, 1992; Taketazu et al, 1994). 
LTBP produced by different cell types is heterogeneous in size, perhaps because of 
alternative splicing or because of tissue-specific proteolytic processing (Miyazono et al, 
1988; Wakefield et al, 1988; Kanzaki et al, 1990; Tsuji et al, 1990). Latent TGF-/3 
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complexes that contain LTBP are known as large latent complexes. LTBP has no known 
covalent linkage to mature TGF-j3, but rather it is linked by a disulfide bond to LAP. 

Two LTBPs have been isolated to date. The deduced human LTBP-1 amino acid 
5 sequence is comprised of a signal peptide, 16 epidermal growth factor-like repeats with the 
potential to bind calcium (EGF-CB repeats), 2 copies of a unique motif containing 8 cysteine 
residues, an RGD cell attachment motif, and an 8 amino acid motif identical to the cell 
binding domain of the laminin B2 chain (Kanzaki et al, 1990). There is evidence that 
LTBP-1 binds calcium, which, in turn, mduces a structural change that protects LTBP from 
10 proteolytic attack (Colosetti et al, 1993). LTBP-2 shows 41% sequence identity to LTBP-1, 
and its structural domains show a similar overall organization (Moren et al, 1994). 

While the functions of LTBP-1 and LTBP-2 presently are unknown, several ideas 
D have been put forward in the literature. First, LTBP may regulate the intracellular 

m biosynthesis of latent TGF-jS precursors. Cultured erythroleukemia cells efficiently assemble 

^15 and secrete large latent TGF-jS complexes, whereas they slowly secrete small latent TGF-jS 
5 complexes that contain anomalous disulfide bonds (Miyazono et al, 1991; 1992). Therefore, 

iji LTBP may facilitate the normal assembly and secretion of latent TGF-jS complexes. Second, 

LTBP may target latent TGF-)3 to specific types of connective tissue. Recent evidence 
P suggests that the large latent TGF-jS complex is covalently bound to the extracellular matrix 

rtj20 via LTBP (Taipale et al, 1994). Based on these observations, LTBP has been referred to as 
^ a "matrix receptor", /.e. a secreted protein that targets and stores latent growth factors such 

as TGF-jS to the extracellular matrix. Third, LTBP may modulate the activation of latent 
complexes. This idea is based in part on recent evidence which suggests that mature TGF-jS 
is released from extracellular storage sites by proteases such as plasmin and thrombin and 
25 that LTBP may protect small latent complexes from proteolytic attack (Falcone et al, 1993; 
Benezra et al, 1993; Taipale et al, 1994), i,e, protease activity may govern the effect of 
TGF-jS in tissues, but LTBP may modulate this activity. Fourth, LTBP may plays an 
important role in targeting the latent TGF-jS complex to the cell surface, allowing latent 
TGF-jS to be efficiently activated (Flaumenhaft et al, 1993). 
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A. MATERIALS AND METHODS 

1. cDNA Cloning 

Aliquots (typically 40-50,000 PFU) of phage particles from a cDNA library in the 
XZAPII® vector made from NIH 3T3 cell mRNA (Stratagene) and fresh overnight 
XLl-Blue™ cells (grown in Luria broth supplemented with 0.4% maltose in 10 mM MgS04) 
were mixed, incubated for 15 min. at 37°C, mixed again with 9 ml of liquid (50°C) top 
layer agarose (NZY broth plus 0.75% agarose), and then spread evenly onto freshly poured 
150 mm NZY-agar plates. Standard methods were used for the preparation of plaque-lifts and 
filter hybridization (42°C, in buffer containing 50% formamide, 5X SSPE, IX Denhardt's, 
0.1% SDS, 100 mg/ml saUnon sperm DNA, 100 mg/ml heparin). Filters were washed 
progressively to high stringency (O.IX SSC/0.1% SDS, 65°C). cDNA probes were 
radiolabeled by the nick translation method using commercially available reagents and 
protocols (Nick Translation Kit, Boehringer Mannheim). Purified phage clones were 
converted to pBluescript® plasmid clones, which were sequenced usmg Sequenase (v2.0) as 
described (Chen et al, 1993; Yin et al, 1995). Sequence alignment and identity was 
determined using sequence analyst programs from the Genetics Computer Group 
(Mac Vector). 

2. Tissue In Situ Hybridization 

To prepare normal sense and antisense probes, a unique 342 bp fragment from the 3' 
untranslated region (+3973 to +4314, counting the "A" of the initiator Met codon as +1; 
see "ish", FIG. 1) was subcloned into the pBSKS+ plasmid (Stratagene, Inc.). Template 
DNA was linearized with either EcoRI or BamHl, extracted, and precipitated with ethanol. 
Sense and antisense transcripts were generated from 1 mg template with T3 and T7 
polymerases in the presence of ['^S]UTP at >6 mCi/ml (Amersham, > 1200 Ci/mmol) and 
1.6 U/ml RNasin (Promega), with the remaining in vitro transcription reagents provided in a 
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kit (SureSite, Novagen Inc.). After transcription at 37°C for 1 h, DNA templates were 
removed by a 15 min. digest at 37°C with 0.5 U/ml RNase-free DNase I, extracted, and 
precipitated with ethanol. Riboprobes were hydrolyzed to an average final length of 150 bp 
by incubating in 40 mM NaHCOj. 60 mM NaaCO,, 80 mM DTT for -40 min. at 60°C. 
Hydrolysis was terminated by addition of sodium acetate, pH 6.0, and glacial acetic acid to 
.09 M and 0.56% (v/v), respectively, and the probes were then ethanol precipitated, 
dissolved in 0.1 M DTT, counted, and stored at -20°C untU use. Day 8.5-9.0, day 13.5, 
and day 16.5 mouse embryo tissue sections (Novagen) and the in situ hybridization protocol 
were exactly as described (Chen et al. , 1993; Yin et al. 1995). 

3. Northern Analysis 

MC3T3-E1 cell poly(A+) RNA (2-10 mg aliquots) was electrophoresed on a 1.25% 
agarose/2.2 M formaldehyde gel and then transferred to a nylon membrane (Hybond-N, 
Amersham). The RNA was cross-linked to the membrane by exposure to a UV Ught source 
(1.2 X 10<* mJ/cm\ UV Stratalinker 2400, Stratagene) and then pre-hybridized for > 15 min. 
at 65 °C in Rapid-Hyb buffer (Amersham, Inc.). A specific cDNA probe consisting solely of 
untranslated sequence from the 3' end of the transcript was '^P-labeled by random priming 
and used for hybridization (2 h at 65 °C). Blots were washed progressively to high 
stringency (O.IX SSC/0.1% SDS, 65 °C), and then placed against x-ray fihn with mtensifying 
screens (XAR, Kodak) at -86°C. 

4. Antibody Preparation 

LTBP-3 antibodies were raised against a unique peptide sequence found in domain #2 
(amino acids 155-167). Peptide #274 (GESVASKHAIYAVC) (SEQ ID N0:5) was 
synthesized using an ABI model 431 A synthesizer employing FastMoc chemistry. The 
sequence was confirmed using an ABI473 protein sequencer. A cysteine residue was added 
to the carboxy-terminus to facilitate crosslinking to carrier proteins. For antibody 
production, the synthetic peptide was coupled to rabbit serum albumin (RSA) using MBS 
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(m-maleimidobenzoic acid-N-hydroxysuccinimide ester) at a substitution of 7.5 mg peptide 
per mg of RSA. One mg of the peptide-RSA conjugate in 1 ml of Freund's complete 
adjuvant was injected subcutaneously at 10 different sites along the backs of rabbits. 
Beginning at 3 weeks after initial immunization, the rabbits were given bi-weekly booster 
injections of 1 mg peptide-RSA in 100 ul of Freund's incomplete adjuvant. IgG was 
prepared by mixing immune serum with caprylic acid (0.7 ml caprylic acid per ml serum), 
stirring for 30 min., and centrifuging at 5,000 X j for 10 min. The supernatant was 
decanted and dialyzed against two changes of phosphate buffered saline (PBS) overnight at 
4°C. The antibody solution was then affinity purified by passing it over a colunrn containing 
the immunizing peptide coupled to Affi-gel 10 affinity support. Bound antibodies were 
eluted with 0.2 M glycme (pH 2.3), immediately dialyzed against PBS, and concentrated to 1 
mg/ml. prior to storage at -70°C. 

5. Transfection 

Transient transfection was performed using standard protocols (Sambrook et al. , 
1989). Briefly, subconfluent cellsjcovering -20% of a 100 mm plastic tissue culture dish) 
were washed 2x in DMEM tissue culture medium (GIBCO) and then incubated for 3 hrs. at 
37°C in a sterile mixture of DEAE-dextran (0.25 mg/ml), chloroquine (55 mg/ml), and 15 
mg plasmid DNA (Courey and Tjian, 1988). Cells then were shocked by incubation with 
10% DMSO in sterile PBS for 2 min. at 37°C, washed 2x with DMEM (Sambrook et al, 
1989), and incubated in DMEM plus 10% fetal calf serum and antibiotics for 72 hr. at 37°C 

6. Immunoprecipitation 

For immunoprecipitation, 1 ml of antibody (1:400 final concentration, in PBS-TDS 
buffer: 0.38 mM NaCl, 2.7 mM KCl, 8.1 mM Na2HP04, 1.5 mM KH2PO4, 1% Triton 
X-100, 0.5% Deoxycholic acid, and 0.1% SDS) was added to 1 ml of radiolabeled medium 
proteins. The mixture was incubated with shaking at 4°C for 1 hr., protein A-sepharose 
CL-4B beads were added (200 ml, 10% suspension), and this mixmre was mcubated with 
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shaking for one additional hour at 4°C. Immunoprecipitated proteins were pelleted by brief 
centrifugation, the pellet was washed 6x with PBS-TPS buffer, 2x protein loading dye was 
added, and the samples were boiled for 5 min. and then fractionated on 4-18% gradient 
SDS-PAGE (Bonadio et al. 1985). Cold molecular weight markers (200 kDa-14.3 kDa, 
5 Rainbow mix, Amersham) were used to estimate molecular weight. The gel was dried and 
exposed to fihn for the indicated time at room temperature. 



7. Western Analysis 



in 



10 Fractionated proteins within SDS-polyacrylamide gels were transferred to a 

nitrocellulose filter for 2 hours using Tris-glycine-methanol buffer, pH 8.3 at 0.5 mA/cm^ 
The filter was blocked, incubated with nonfat milk plus antibody (1:1000 dilution) for 2 hr, 
and washed. Antibody staining was visualized using the ECL Western blotting reagent 
(Amersham) according to the manufacturer's protocols. 



m 

m 
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B. RESULTS 




In this study, the inventors isolated and characterized a novel murine fibrillin-like 
NA encoding LTBP-3\ To clone the murine LTBP-3 gene, cDNA from a 3T3 cell cDNA 
50 library was amplified using^uman fibrillin- 1 PGR™ primers under low stringency conditions 
5 {i-e., annealing at 37°C initiaV for 10 cycles, followed by annealing at eO^C for 30 cycles). 
The results indicated that a murine DNA fragment of unexpectedly low homology (-50%) 
to human fibrillin- 1 was obtained. Idolecular cloning of the authentic murine fibrillin- 1 
transcript was also performed, confimihig the human and murine fibrillin-1 coding sequences 
25 share >95% sequence identity. The murt^e fibrillin-1 and PGR™ sequences were different, 
which suggested that the PGR™ product maySiave been derived from a related, fibrillin-like 
cDNA. The 3T3 cell cDNA library was screen^ at high stringency using the murine PGR™ 
product as the probe in order to test this hypothesis A cDNA walking strategy eventually 
yielded seven overlapping cDNA clones (FIG. 1). It provides a unique mRNA of 4,314 
30 nucleotides, with an open reading frame of 3,753 nucle^des (SEQ ID N0:3) ,FIG. 9). The 
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deduced molecule is a unique polypeptide of 1,251 amino acids (SEQ ID N0:4) (FtG. 10\ 
Excluding the signal peptide (21 amino acids), the novel fibriUin-like molecule consists of 
five structurally distinct regions (Region 1- Region 5), and although simUar to murine 
fibrillin-1 (FIG. 2A), its domain structure is unique as is evidenced by the schematic 
5 representation of LTBP-3 shown in FIG. 2B. 

Domam #1 is a 28 an-ino acid seg nerit with a net basic charge (est pi, 12.36) that 
may allow for binding acidic molecules in the extracellular matrix (e.g., acidic 
proteoglycans). Sequences rich in basic amino acids may also function as endoproteolytic 
10 processing signals (Barr, 1991; Steiner et al. , 1992), which suggests that the NH^-terminus 
may be proteolytically processed. Domain #2 extends for 390 amino acids, consisting of an 
EGF-like repeat, a 135 amino acid segment that was proline-rich (20.7%) and glycine-rich 
,i (11.8%) but not cysteine-rich, a Fibmotif (Pereira et al, 1993), an EGF-CB repeat, and a 
5 TGF-bp repeat. Domain #3 is a 113 amino acid segment characterized by its high proline 
m content (21 %). Domain #4 extends for 678 amino acids and consists of 14 consecutive 
I cysteine-rich repeats. Based on strucmral homologies, 12/14 repeats were epidermal growth 
ffi factor-calcium binding (EGF-rB) motifs (Handfbrd et al. , 1991), whereas 2/14 were 
n transforming growth factor-/3-binding protein (TGF-bp) motifs (Kanzaki et al. , 1990). 

Finally, domain #5 is a 22 amino acid segment at the carboxy-terminus. The concepmal 
amino acid sequence encoded by the open reading frame consisted of 1,251 amino acids 
(FIG. 2B) with an estimated pi of 5.92, a predicted molecular mass of 134,710 Da, and five 
potential iV-linked glycosylation sites. No RGD sequence was present. 

Northern blot analysis of murine embryo RNA using a 3' untranslated region probe 
25 identified a transcript band of -4.6 kb. In this regard, 4,310 nt have been isolated by 

cDNA cloning, including a 3' untranslated region of 401 nt and a 5' upstream sequence of 
156 nt. The apparent discrepancy between the Northern analysis result and the cDNA 
sequence analysis suggested that the 5' upstream sequence may include -300 nt of additional 
upstream sequence. This estimate was consistent with preliminary primer extension mapping 
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studies indicating that the 5' upstream sequence is 400-500 nt in length. 

A total of 19 cysteine-rich repeats were found in domams #2 and #4 of the murine 
LTBP-like (LTBP-3) polypeptide. Thirteen were EGF-like and 11/13 contained the calcium 

5 binding consensus sequence. This consensus was derived from an analysis of 154 EGF-CB 
repeats in 23 different proteins and from stnictoral analyses of the EGF-CB repeat, both 
bound and unbo md to calcium ion (Selander Sunneihagen et al. , 1992). \^ariations on the 
consensus have been noted previously and one of these, D-L-N/D-E-C,, was identified in the 
third EGF-like repeat of domain #4. In addition, a potential calcium binding sequence which 

10 has not previously been reported (E-T-N/D-E-Ci) was identified in the first EGF-like repeat 
of domain #4. Ten of thirteen EGF-CB repeats also contained a second consensus sequence 
which represents a recognition sequence for an Asp/Asn hydroxylase that co- and 

J post-translationally modifies D/N residues (Stenflo et al. , mi; Gronke et al. , 1989). 



m 



Is Although about one-half the size, the deduced polypeptide was organized like fibrUlm- 

I 1 in that it consisted of a signal peptide followed by 5 structorally distinct domains, i.e. , two 
S domains with numerous EGF-like^EGF-CB and Fib repeats and a third with a proline-rich 
U sequence (Pereira et al. , 1993). However, comparison of each of these domains using the 
^ GAP and BESTFIT programs (Genetics Computer Group) has revealed a low level of amino 
fgO acid homology of only 27% over the five structural domains shared by the deduced murine 
polypeptide and human fibrillin-2. These values are low for a putative fibrillin family 
member because fibrillin-1 and fibrilIin-2 share -50% identity (Zhang et al., 1994). 



a 
□ 



A search of available databases revealed that the deduced murine polypeptide was 
25 most sunilar to the human and rat latent TGF-jS binding proteins (Kanzaki et al. , 1990; Tsuji 
et al. , 1990). In this regard LTBP-3 was found to be similar to fibrillin in that it could also 
be divided into five structurally distinct domains (FIG. 2A, FIG. 2B, and FIG. 2C). These 
include a relatively short domain downstream of the signal peptide with a net basic charge 
(amino acids 21-33, est. pi, 11.14); a domain consisting of EGF-like, EGF-CB, TGF-bp, and 
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Fib motifs plus a proline-rich and glycine-rich sequence (amino acids 34-407); a proline-rich 
domain (amino acids 408-545); a large, domain consisting of EGF-CB, TGF-bp, and TGF- 
bp-like repeat motifs (amino acids 546-1379); and a relatively short domain at the carboxy 
terminus (amino acids 1380-1394). Amino acid sequence comparison of the deduced murine 
5 and human polypeptides shows 60% identity for domain #1, 52% identity for domain #2, 
30% identity for domain #3, 43% identity for domain #4, and 7% identity for domain #5. 
The average idertity over the five domains shared by the murme polypeptide and human 
LTBP was 38.4%. Significanfly, cysteine residues in both polypeptide sequences were 
highly conserved. 

10 

The fibrillins are exclusively expressed by connective cells in developing tissues 
(Zhang et al, 1994), whereas LTBP should be expressed along with TGF-^ by both 
? epithelial and comiective cells (Tsuji et al , 1990). The structural homology data therefore 
fi predict that the murine LTBP-3 gene shown in FIG. 2B should be expressed by both 
m5 epithelial and comiective tissue cells. Tissue in situ hybridization was used to test this 
fl hypothesis. 

m 

p An overview of the expression pattern as determined by tissue in situ hybridization is 

5 presented m FIG. 3A, FIG. 3B, FIG. 3C, FIG. 3D, FIG. 3E and FIG. 3F. 

5 Approximate mid-sagittal sections of normal murine embryos at days 8.5-9.0, 13.5 

and 16.5 p.c. of development were hybridized with a ^^S-labeled single stranded normal 
sense riboprobe from the same cDNA construct was used. At day 8.5-9.0 of development, 
intense gene expression was observed in the mesometrial and anti-mesometrial uterine 
25 tissues, ectoplacental cone, placenta, placental membranes. The transcript appeared to be 

widely expressed in murine embryo mesenchymal/connective tissue compartments, includmg 
the facial mesenchyme, at days 8.5-9.0, 13.5 and 16.5 of development. Particularly intense 
expression of the transcript was noted in the liver. 
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Microscopy of day 8.5-9.0 embryos confirmed the widespread expression of the 
murine gene by mesenchymal cells. Significant expression of the transcript by cells of the 
developing central nervous system, somites and cardiovascular tissue (myocardium plus 
endocardiimi) was also observed. 

Microscopy of day 13.5 and day 16.5 embryos demonstrated expression of the murine 
gene by skeletal muscle cells and by cells involved in intramembranous and endochondral 
bone formation. The transcript was expressed by osteoblasts and by periosteal cells of the 
calvarium, mandible and maxilla. The transcript was also identified in both cartilage and 
bone of the lower extremity. A positive signal was detected in perichondrial cells and 
chondrocytes (proliferating > mature > hypertrophic) of articular cartilage, the presumptive 
growth plate, and the cartilage model within the central canal. The positive signal was also 
expressed by blood vessel endothelial cells within the mid-diaphysis, and the surrounding 
muscle cells. 

Respiratory epithelial cells lining developing small airways and connective tissue cells 
in the puhnonary interstitium ^xpressed the murine transcript, as did myocardial cells (atria 
and ventricles) and endocardial cushion tissue. Cells within the walls of large arteries also 
expressed the transcript. Expression of the murine gene was identified in several organs of 
the alimentary system, including the tongue, esophagus, stomach, small and large intestine, 
pancreas and liver. Mucosal epithelial cells lining the upper and lower digestive tract plus 
the smooth muscle and connective tissue cells found in the submucosa expressed the 
transcript, as did acinar cells of the exocrine pancreas. Despite the high level of transcript 
expression in the liver, these results suggest both cell populations express the LTBP-3 
transcript. 

In the kidney, expression above the basal level was observed in cells of developing 
nephrons, the ureteric bud, kidney blastema and the kidney interstitium. In the skin, 
epidermal and adnexal keratinocytes, dermal connective tissue cells, and brown fat cells 
within the dorsal subcutis expressed the murine transcript. In the central and peripheral 
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nervous systems, ganglion ceUs within the cerebrum, brainstem, spinal cord, and peripheral 
nerves expressed the murine transcript. The transcript was also intensely expressed by cells 
of the developing murine retina. 

5 Thus, the murine gene is widely expressed by both epithelial and connective tissue 

cells, a pattern that would be expected for a latent TGF-0 binding protein. Three final 
observations argue that the LTBP-like (LTBP-3) sequence presented herem is not simply the 
murine homologue of human LTBP. First, domain #4 of the murine LTBP-like (LTBP-3) 
sequence has a smaller number of EGF-like repeat motifs than human and rat LTBP (8 
10 versus 11). Second, portions of the human and rat LTBP-like coding sequence were 

characterized and found to share -90% identity with human and rat LTBP but only 65% 
identity with the murme LTBP-like gene. Third, the human LTBP and LTBP-like genes are 
5 localized to separate chromosomes. Human LTBP was assigned to human chromosome 2 
i based on the analysis of human x rodent somatic cell hybrid lines (Stenman et al. , 1994). 
^ The present invention represents the first mapping of an UTBP gene in the murine. The 

human LTBP-like genes was recently localized to chromosome 11 band ql2, while the 
murine gene was mapped to murine chromosome 19, band B (a region of conserved 
synteny), using several independent approaches, including fluorescent in situ hybridization. 



if 



m 
tfl 

in 



01 

5o The first indication of alternative splicing came from molecular cloning studies in the 

i murine, in which independent cDNA clones were isolated with a deletion of 51 bp from the 
^ coding sequence. PCR™/Southem blot analysis provided additional evidence that the 
homologous 51 bp sequence was alternatively spliced in normal murine embryo tissues. 

25 Northern blot analysis also demonstrated that the novel fibrillin gene was also 

expressed in rat callus three weeks after osteotomy, after mineralization has begun. Gene 
expression in normal adult rat bone tissue was insignificant, which suggests that microfibrils 
are an important part of the bone fracture healing response. The novel fibrillin-like gene was 
expressed in callus as a pair of alternatively spliced transcripts. This resuU has been 

30 independently reproduced on three occasions. Molecular cloning of the novel fibrillin gene 
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in both murine and rat has identified potential splice junction sites for the alternative splicing 
event. 

MC3T3-E1 murine pre-osteoblasts were used to demonstrate that the murine gene 
5 product was capable of binding TGF-/3. MC3T3-E1 cells were utilized because they 
synthesize and secrete TGF-|3, which may act as an autocrine regulator of osteoblast 
proliferation (Amamani et al., 1993; Van Vlasselaer et ai, 1994; Lopez-Casillas et al, 
1994). 

10 To determine whether or not MC3T3-E1 cells co-expressed the murine gene product 

of TGF-|3, cells were plated on 100-mm dishes under differentiating conditions (Quarles 
et al, 1992) and the medium was replaced twice weekly. Parallel dishes were plated and 
5 assayed for cell number and alkaline phosphatase activity, which confirmed that osteoblast 
S differentiation was indeed taking place. Equal aliquots of total cellular RNA was prepared 

from these MC3T3-E1 cells after 5, 14 and 28 days in culture for Northern blot analysis. As 
shown m FIG. 4, expression of the new murine gene peaked on day 14 of culture. Since 
MC3T3-E1 cells also show a peak in alkaline phosphatase activity on day 14 of culture 
(Quarles et al, 1992), the results suggest for the first time that LTBP-3 gene expression is 
fl an early marker of osteoblast differentiation. 

m 

□ C. DISCUSSION 

This smdy reports the molecular cloning of a novel LTBP-3 gene that contains 
numerous EGF-like repeats. Northern analysis indicates that the gene encodes a single 

25 transcript of -4.6 kb in murine embryo tissues. The deduced amino acid sequence of the 
murine gene product appears to be a secreted polypeptide of 1,251 amino acids. Although it 
is similar to fibrillin, the overall structural organization and expression pattern of this gene 
product most resembles LTBP, a latent TGF-)3 binding protein that was originally isolated 
and characterized by Heldin and co-workers (Kanzaki et al, 1990). Several observations 

30 strongly suggest that LTBP and the murine LTBP-like gene product are therefore derived 
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from related but distinct genetic loci. First. LTBP and the LTBP-like coding sequence share 
-40% identity and differences exist in the number of EGF-CB repeats in the deduced 
polypeptide sequence of the two molecules. Second, a portion of the murine LTBP gene has 
been cloned and shown to share -90% identity with human and rat LTBP. Conversely, 
portions of the human and rat LTBP-like genes have been cloned and shown to share -90% 
identity with the murine LTBP-like gene. Third, LTBP and the LTBP-like gene reside on 
different human chromosomes (Stenman et ai, 1904). Taken together, these data suggest 
that a family of at least two LTBP genes exists. 

Similarities in the structural organization of LTBP-1 and the fibrillin-1 and fibrillin-2 
polypeptides have been noted previously (Pereira et ai, 1993; Zhang et al, 1994; Taipale 
et ai, 1994). For example, LTBP-1 and the fibrillins are all secreted extracellular matrix 
constituents. Moreover, each polypeptide can be organized into five domains, two of which 
consists predominantly of EGF-CB and TGF-bp repeat motifs. LTBP-1 and fibrillin-1 also 
share a domain that is proline-rich, and LTBP possesses an 8-cysteine repeat previously 
referred to as the "Fib motif" because it was assumed to be unique to fibrillin (Pereira et al., 
1993). These similarities likely explain the initial isolation and cloning of the LTBP-3 PGR™ 
product, especially since the human oligonucleotide primers used to initially amplify murine 
cDNA were designed to direct the synthesis of an EGF-CB repeat in domain #4. 

Another point of distinction between LTBP-3 and fibrillin concerns the spacing of 
conserved cysteines C4 and C5 in EGF-like repeats. Fibrillin-1 and fibrillin-2 each contain 
>50 such repeats, and in every one the spacing is C4-X-C5. While this pattern is repeated in 
a majority of the EGF-like repeats in LTBP-1 and LTBP-2, both genes also contain repeats 
with the spacing C4-X-X-C5. Although the significance of this observation is unclear, 
variation in the number of amino acids between C4 and C5 would not be expected to alter the 
function of the EGF-like repeat. Mature EGF is a 48 amino acid secreted polypeptide 
consisting of two subdomains that have few interdomain contacts (Engel, 1989; Davis, 1990). 
The larger NHz-terminal subdomain consists of residues 1-32 and is stabilized by a pair of 
disulfide bonds (C1-C3 and C2-C4), whereas the smaller COOH-terminal subdomain (amino 
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acids 33-48) is stabilized by a single disulfide bond (Cs-C^. The COOH-terminal subdomain 
has a highly conserved conformation that only is possible if certam residues and the distances 
between them are well conserved, while conformation-sequence requirements for the NH2- 
terminal subdomain are relatively relaxed. Variation in C4-C5 spacmg would not be expected 
to alter conformation because these residues do not normally form a disulfide bond and the 
spacing variation occurs at the interface of subdomams that would not be predicted to 
interact. The -loning of additional genes will decide whether variation in C4-C5 spacing is a 
reliable discriminator between members of the LTBP and fibrUlin gene families. 

The LTBP-3 gene is expressed more widely during development than fibrillin- 1 or 
fibrillin-2. Studies in developing murine tissues have shown that the Fbn-1 gene is expressed 
by mesenchymal cells of developing connective tissue, whereas the murine LTBP-like gene is 
intensely expressed by epithelial, parenchymal and stromal cells. Earlier reports have 
suggested that TGF-^ plays a role in differentiation and morphogenesis during murine 
development (Lyons and Moses, 1990), when TGF-/3 is produced by epithelial, parenchymal 
and stromal cells. Tsuji et al., (1990) and others have suggested that the expression of TGF- 
18 binding proteins should mirror that of TGF-j8 itself; the expression pattern of the LTBP-3 
gene over the course of murine development is consistent with this expectation. However, 
the LTBP-3 gene may not be completely co-regulated with TGF-/3. TGF-^ gene and protein 
expression during murine development has been surveyed extensively (Heine et al., 1987; 
Lehnert and Akhurst, 1988; Pelton et al., 1989; Pelton et al., 1990a, b; Millan et al., 1991); 
these studies have not identified expression by skeletal muscle cells, chondrocytes, 
hepatocytes, ganglion cells, mucosal cells lining the gut, and epithelial cells of developmg 
nephrons. It is conceivable that the LTBP-3 molecule has an additional function in certain 
connective tissues besides targeting TGF-18. 

The binding properties of the LTBP-3 gene product are under investigation. 
Formally, the LTBP-3 polypeptide may bind a specific TGF-/3 isoform, another member of 
the TGF-18 superfamily (e.g., a bone morphogenetic protein, inhibin, activin, or MuUerian 
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inhibiting factor), or a growth factor unrelated to TGF-/3. Anti-peptide antibodies to the 
murine LTBP-3 polypeptide have been generated and osteoblast cell lines that express the 
molecule at relatively high levels have been identified. Studies with these reagents suggest 
that LTBP-3 assembles intracellularly into large latent complexes with a growth factor that is 
5 bemg characterized by immunological methods. 

Ttie presence of dibasic amino aci Is in the LTBP-S sequence sugge ts that it may 
undergo cell- and tissue-specific proteolysis. TGF-j8 regulates extracellular matrix 
production by suppressing matrix degradation (through a decrease m the expression of 
10 proteases such as coUagenase, plasminogen activator, and stromelysin plus an mcrease in the 
expression of proteinase inhibitors such as plasmmogen activator inhibitor- 1 and tissue 
inhibitor of metalloproteinase-1) and by stimulating matrix macromolecule synthesis (for 
5 recent review, see Lyons and Moses, 1990; Massague, 1990; Laiho and Keski-Oja, 1992; 
i Miyazono et al. , 1992). Conversely, production of extracellular matrix has been shown to 
'U5 down regulate TGF-|8 gene expression (Streuli et al. , 1993). TGF-/3 may therefore regulate 
J extracellular matrix production through a sophisticated feedback loop that influences the 
5 expression of a relatively large number of genes. LTBP-1 and LTBP-2 may contribute to 
O this regulation by facilitating the assembly and secretion of large latent growth factor 
complexes and then targeting the complex to specific connective tissues (Taipale et al. , 
1994). 



m 



If LTBP-3 is like LTBP-1, it has the potential to function as a secreted, extracellular 
structural protein. As demonstrated here, domain #1 of LTBP-3 appears to be a unique 
sequence that likely has a globular conformation. Domain #1 also is highly basic and may 
25 facilitate LTBP-3 binding to acidic molecules (e.g. , acidic proteoglycans) within the 

extracellular space. Sequences rich in basic amino acids have also been shown to function as 
endoproteolytic processing signals for several peptide hormones (Barr, 1991; Steiner et al. , 
1992). It is possible, therefore, that the NHj-terminus of LTBP-3 is proteolytically 
processed in a tissue-specific manner. 



-71- 



Domains #2 and #4 consist of consecutive cysteine-rich repeats, the majority of which 
are of the EGF-CB type. Besides binding calcium (Corson et al , 1993), these repeats may 
provide LTBP-3 with regions conformation capable of interacting with other matrix 
macromolecules (Engel, 1989). Domain #3 is proline rich and may be capable of bending 

5 (or functioning like a hinge) in three-dimensional space (MacArthur and Thornton, 1991). 
(In this regard, domain #2 is of interest because it has a similar stretch of 135 amino acids 
that is boai prolue- and glycme-rich. Since glycine-rich sequences are also thought to be 
capable of bending or functioning like a hinge in three-dimensional space, this amino acid 
sequence may interrupt the extended conformation of domain #2, thereby providing it with a 

10 certain degree of flexibility in three-dimensional space.) 

Domain #5 also appears to be a unique sequence having a globular conformation. 
O The absence of a known cell attachment motif may indicate that, in contrast to LTBP-1, the 
fi LTBP-3 molecule may have a more limited role m the extracellular matrix (i.e. , that of a 
lis stmcmral protein) in addition to its ability to target latent TGF-^ complexes to specific 

f: connective tissues. 

ffil 

Hi _^ 

MC3T3-E1 pre-osteoblasts co-express LTBP-3 and TGF-/31 and these proteins form a 
ffi complex in the culture medium. These results are particularly interesting because bone 
mo represents one of the largest known repositories of latent TGF-/3 (200 /xg/kg bone; Seyedin 
S et al. , 1986 and 1987), and because this growth factor plays a critical role in the 

determination of bone strucmre and function. For example, TGF-^ is thought to (i) provide 
a powerful stimulus to bone formation in developing tissues, (ii) function as a possible 
"coupling factor" during bone remodeling (a process that coordinates bone resorption and 
25 formation), and (iii) exert a powerful bone inductive stimulus following fracture. Activation 
of the latent complex may be an important step governing TGF-^ effects, and LTBP may 
modulate the activation process (e.g., it may "protect" small latent complexes from 
proteolytic attack). 

30 Expression of large latent TGF-/3 complexes bearing LTBP may be physiologically 
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relevant to, i.e., may be part of the mechanism of, the pre-osteoblast -♦ osteoblast 
differentiation cascade. This is based on the evidence that MC3T3-E1 cells express large 
latent TGF-jSl complexes bearing LTBP-2 precisely at the time of transition from the 
pre-osteoblast to osteoblast phenotype (-day 14 in culture, or, at the onset of alkaline 
5 phosphatase expression; see Quarles et al. , 1992). The organ culture model, for example, 
likely is comprised of differentiated osteoblasts but few bond progenitors, making it a 
difficult model i: best in which to study die differentiation cascade (Dallas et al. , 1984). It 
is also well known that MG63, ROS17/2.8 and UMR 106 cells are rapidly dividmg and they 
express the osteoblast phenotype. Thus, these osteoblast-like cell lines do not show the 
10 uncoupling of cell proliferation and cell differentiation that characterizes the normal 

(physiologically relevant) pre-osteoblast -* osteoblast transition (Gerstenfeld et al. , 1984; 
Stein and Lian, 1993). Therefore, the production of small versus large latent TGF-/3 
complexes may be associated with specific stages in the maturation of bone cells. 



Jis LTBP-3 may bind calcium, since EGF-CB repeats have been shown to mediate high 

2 affinity calcium binding in LTBP-1 and other proteins (Colosetti et al., 1993). Calcium 

|j| binding, in turn, may contribute to molecular conformation and the regulation of its 

fn interactions with other molecules. The presence of dibasic amino acids suggests that LTBP-3 
may also undergo cell- and tissue-specific proteolysis. TGF-|3 regulates extracellular matrix 

PlEO production by suppressing matrix degradation (through a decrease in the expression of 

S proteases such as coUagenase, plasminogen activator, and stromelysin plus an increase in the 

Titer 

expression of proteinase inhibitors such as plasminogen activator inhibitor-l and tissue 
inhibitor of metalloproteinase-l) and by stimulating matrix macromolecule synthesis (for 
recent reviews, see Lyons and Moses, 1990; Massague, 1990; Laiho and Keski-Oja, 1992; 
25 and Miyanzono et al. , 1993). Conversely, production of extracellular matrix has been shown 
to down regulate TGF-jS gene expression (Streuli et al. , 1993). TGF-/3 may therefore 
regulate extracellular matrix production through a sophisticated feedback loop that influences 
the expression of a relatively large number of genes. LTBP-1, LTBP-2, and LTBP-3 may 
contribute to this regulation by facilitating the assembly and secretion of large latent growth 
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factor complexes and then targeting the complex to specific connective tissues (Taipale et al. , 
1994). 

EXAMPLE n 
PREPARATION OF LTBP-3 ANTmODIES 

An affinity-purified antibody (#274) capable of immunoprecipitatmg was prepared 
against the murine LTBP-3 gene product. Full-length murine cDNAs were assembled into 
the pcDNA3 mammalian expression vector (Invitrogen) and expressed following transient 
transfection of 293T cells. Nascent polypeptides, radiolabeled by addition of ^^S Cys to the 
medium of transfected cells, were immunoprecipitated using affmity-purified antibody #274. 
As shown in FIG. 5, the new murine polypeptide was estimated to be 180-190 kDa. To 
ensure the specificity of #274 binding, preincubation with 10 ng of synthetic peptide was 
shown to block immunoprecipitation of the 180-190 kDa band. 

Finally, MC3T3-E1 cells were cultured for 7 days under differentiating conditions and 
double-labeled with 30 ^iCi/ml ^^S cysteine and '^S methionine in deficient media. 
Radiolabeled media was dialyzed into cold PBS with protease inhibitors. Aliquots of the 
dialyzed medium sample (10* incorporated CPM) were analyzed by a combined 
immunoprecipitation/Westem analysis protocol. The murine polypeptide was clearly and 
reproducibly secreted by MC3T3-E1 cells, migrating under reducing conditions as a single 
band of 180-190 kDa (FIG. 6). Consistent with the results of previous studies (e.g. , 
Miyazono et al, 1988; Dallas et al, 1994; Moren et al, 1994), bands of 70 and 50 kDa 
corresponding to the TGF-j81 precursor were co-imraunoprecipitated with the 180 kDa 
LTBP-3 protein. Weak bands of 40 and 12 kDa were also identified in experiments in which 
only immunoprecipitation was performed. The latter were not included in FIG. 6 because 
they migrated within that portion of the gel included in the Western analysis. Protein bands 
of 70-12.5 kDa are not variant forms of LTBP-3; FIG. 5 demonstrates that LTBP-3 migrates 
as a single band of 180-190 kDa following transient transfection of 293T cells, which fail to 
make TGF-/31. By immunoprecipitation, a unique band consistent with monomeric mature 
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TGF-/31 was found in the LTBP-2 immunoprecipitate. Antibody #274 is incapable of 
binding TGF-^1 as determined by radioimmunoassay using commercially available reagents 
(R&D Systems) and the manufacturer's suggested protocols. These results have been 
reproduced in 6 independent experiments which utilized 3 separate lots of MC3T3-E1 
medium. Thus the new murine LTBP-3 polypeptide binds TGF-/3 in vitro. 



^ ^ ^ co-tranf '^ion studies of 293T Cells using pLTBP-3fl^dpTGF-01, 

^(^precipitation\LTBP-3 and TGF-/31 was demonstrated W 293T cells following 
"tonsient transfection andWiolabeling. Aliquots (-10^ incorporated CPM) of radiolabeled 
10 media were immunoprecipi^d and separated using 4%-18% gradient ST^S^PAGE and either 
reducing or nonreducing condm^^ns as described (Yin et al., 1995) ^FIG. 13). 



IJ1 
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EXAMPLE in 
ISOLATION OF A GENE ENrODTNG MT TRINE LTBP-2 



In addition to determining the DNA and corresponding polypeptide sequence of the 
murme LTBP-3 gene, the murine LTBP-2 gene was also cloned and sequenced. 



m 




25 



30 



1. Cloning and DNA sequencing of LTBP-2 

o identify iW murine LTBP family members, independently designed degenerate 
l^ucleotide primer^ere synthesized based on a structural homology shared by human 
t^TBP-1 and mouse LTBPK3 coding sequences: forward primer, 
5'-AAACGTCACACGTGAkcGTGAACGTTGCTTGCTGG-3' (SEQ ID NO: 12); reverse 
primer, 5'-TTACGTCCACGtXcACGTCTAGCAAGCAAGCA-3' (SEQ ID NO: 13), and 
then used PGR™ to amplify singleV^randed mouse embryo cDNA prepared from normal CD- 
1 mouse embryo mRNA. A band oK^Pproximately 400 basepairs (bp) was isolated and 
purified by agarose gel electrophoresisVe DNA was ligated into the TA cloning vector 
(InVitrogen), and the ligation mixmre wa^sed to transform competent bacteria. Plasmid 
DNA (from 28 colony forming units) was prWred and evaluated by DNA sequence analysis. 
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As determined\y sequence identity comparison, 16/28 plasmid DNAs coded for mouse 
LTBP-1, 11/28 c^ed for mouse LTBP-3, and 1/28 coded for an apparently unique sequence. 
The insert DNA froiki the unique plasmid was then used as a probe to screen a cDNA library 
prepared from 3T3 celhuStratagene, Inc.). A walking strategy eventually yieldgdJhe- 
overlapping cDNA clonesNshown in /FIG. Analysis of these clones identified an open 
reading frame of 5,430 baseWirs. Coifiparison of sequence identity using the GAP and 
BESTFIT programs (Genetics Computer Group) revealed 79.7% identity between the mouse 
open reading frame and human U^P-2 (Centrella et al, 1991), but <. 47.1% identity 
between the mouse open reading frabe and human LTBP-1 and mouse LTBP-3. The 
sequence comparison data agreed withVhromosomal localization data, which collectively 
established that the sequence was the mo^e homolog of hiraian LTBP-2. 

The level of amino acid sequence identity (approximately 40%) among the LTBP-1, - 
2, and -3 polypeptides is in die range observed for other protein isoforms that contain 
multiple EGF-like repeats, like the fibrillins (Yin et al. , 1995) and the diverse laminin chains 
(Engel, 1989). 

An LTBP-2 methionine codon in a favorable context for translation initiation was 
provisionally designated the translation start site (see Kozak, 1991). The deduced initiator 
methionine was followed by a signal sequence of approximately 35 amino acids. Consistent 
with the strucmre and length of the human LTBP-2 signal peptide (Centrella et al. , 1991), 
the 15 residues immediately downstream of the mouse LTBP-2 initiator methionine were 
largely hydrophilic in nature, whereas amino acids 16-35 represented a typical hydrophobic 
signal peptide sequence. The small neutral amino acid residues Ser (-3) and Ala (-1) and the 
large polar residue Gin (+1) appeared to defme the signal peptide cleavage site (von Heijne, 
1983). 

In contrast to LTBP-1 and -3, which appear to be organized into 5 strucmrally 
distinct domains downstream of the signal peptide (Yin et al. , 1995), the deduced mouse 
LTBP-2 polypeptide consists of ten alternating struchiral domains that are composed of either 
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proline- and glycine-rich sequences or cysteine-rich repeat motifs. Thus, domain 1 (amino 
acids 36-160) was composed of 19.4% glycine and proline residues. Domain 2 (amino acids 
■ 161-213) consisted of 2 EGF-like repeats. Domain 3 (amino acids 214-344) was composed 
of 22.3% glycine and proline residues. Domain 4 (amino acids 345-413) consisted of 2 

5 cysteine-rich repeats. Domain 5 (amino acids 414-536) was composed of 19.5% glycine and 
proline residues. Domain 6 (amino acids 537-708) consisted of 3 cysteine-rich repeats; based 
on structural homologies, the first repeat was a Fib motif (Pereira et al. , 1993; a copy of this 
reference has been included in the Appendix), the second was an epidermal growth factor- 
calcium binding (EGF-CB) motif (Handford et al. , 1990), and the third was a transforming 

10 growth factor-jSl-binding protein (TGF-bp) motif (Kanzaki et al. , 1990). Domain 7 (amino 
acids 709-831) was composed of 20.3% proline and glycine residues. Domain 8 (amino 
acids 832-1626) consisted of 15 EGF-like repeats and 2 TGF-bp repeats. Domain 9 (amino 

1 acids 1627-1721) was composed of 29.5% glycine and proline residues. Domain 10 (amino 
UI acids 1722-1810) consisted of 2 EGF-like repeat motifs. 

2 The conceptual mouse LTBP-2 amino acid sequence consists of 1,810 amino acids, 

S with an estimated pi of 5.02, a predicted molecular mass of 197,917 Da. , and eight potential 
h N-lmked glycosylation sites. Similar to the mouse LTBP-3 polypeptide, RGD and laminin 
B2 chain cell adhesion sequences were not identified. Altogether, 26 cysteine-rich repeats 
mo were found in the mouse LTBP-2 polypeptide. As described above, 20/26 were 
O characterized by the presence of 6 cysteine residues and therefore were EGF-like. 12/20 

showed the general consensus D/N-I/V-D/N-E/D-Ci, derived from an analysis of 154 EGF- 
like repeats in 23 different proteins and from structural analysis of the coagulation factor X 
EGF-CB repeat, both bound and unbound to calcium ion (Selander-Sunnerhagen et al. , 
25 1992). Variations on the consensus have been noted previously (for examples see Yin et al. , 
1995; Yin et al., 1995), and two of these, were identified in mouse LTBP-2. Two potential 
variants which have not previously been reported were also identified (D-A-D-E-Ci and D-H 
N-E-Ci), giving a total of 16 putative EGF-CB repeats. All 16 repeats also contained a 
proposed recognition sequence (C3-X-D/N-X-X-X-X-Y/F-X-C4) for an Asp/Asn hydroxylase 
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that co-and posttranslationally modifies D/N residues (Stenflo et al., 1987; Gronke et al, 
1989). Previous NMR studies of the isolated first EGF-like domain of human factor IX 
indicate that 3 residues derived from the general consensus and from the recognition 
sequence are direct ligands for calcium (Handford et al, 1990; 1995; 1991). This has led to 
a proposed calcium-binding consensus D/N, D/N, D*/N* (where * denotes a_/3-hydroxylated 
residue). A fourth residue in the consensus as originally proposed, F/Y, is now known not 
to be a direct ligand for calcium (Hughes et al. , 1993). The three amino acids that are direct 
ligands for calcium in factor IX are conserved in each of the 16 putative EGF-CB repeats 
identified in mouse LTBP-2. 



Four of five of the putative domains rich in proline and glycine were also rich in 
basic amino acid residues (domam 1, 15.5%; domain 3, 13.1%; domain 5, 11%; domain 7, 
8.1%; and domain 9, 3.2%). Co-existence of proline and basic amino acids suggests the 
possibility that LTBP-2 undergoes 'proline-directed' endoproteolytic processing (Devi, 1991). 
15 Indeed, both monobasic and dibasic cleavage motifs — e.g., R-R and R-X-X-R, respectively 

(Barr, 1991)— were identified in all five posmlated proline- and glycine-rich structural 
^ domains. In several instances, monobasic cleavage motifs occurred near or witiiin potential 
dibasic cleavage motifs {e.g., Arg 108, Arg 286, Arg 429, and Lys 727). One potential 
monobasic cleavage motif was identified in each of the 5 proline- and glycine-rich domains. 
Dibasic cleavage motifs in general were more prevalent near the deduced LTBP-2 amino 
terminus. Endoproteolytic cleavage of LTBP is of potential interest because it may help 
explain the smaller than expected size of platelet LTBP-1. 



0 contRlete cDNA nucleotide sequence for murine LTBP-2 is shown in (SEQ ID 

2? ITO:1) (FIG. 7). deduced amino acid sequence is shown in (SEQ ID N0:2) ^FIG. 8). 

2. Mouse Ubp-2 Gene Expression in Developing Perichondrium 

he inventors have den^nstrated that ltbp-3 is widely and intensely expressed in both 
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developing maternal tissues {e.g., uterine decidua) and mouse embryo tissues {e.g., 
mesenchyme, connective tissue, epithelia, and parenchyma). Tissue in situ hybridization was 
used to compare and coflttast the developmental expression of ltbp-2 and ltbp-3^ FIG. 12A. 
FIG. 12B, and FIG 12C present an overview of ltbp-2 expression in a mid-sagittdHcctTon of 
a mouse embryo at day 16.5 p. c. of development, when expression is strongest. The section 
was hybridized with a '^S-labfeled single stranded antisense riboprobe synthesized from a 580 
base pair cDNA coding for thd mouse LTBP-2 3' u itranslated region. The probe showed 
<30% sequence identity with tfte 3' untranslated sequences of human ltbp-1 and ltbp-2, 
which is too low to give spurious\hybridization signals under our conditions. A ^^S-labeled 
single stranded normal sense ribopsobe from the same cDNA construct was used as a 
negative control, ltbp-2 expression above background was observed in the snout, base of the 
skull, tail, paw, lung, vertebrae, andVarge vessels of mouse embryos. Microscopy of day 
16.5 p.c. embryo tissue sections, take* from the same slide used to prepare whole mount 
sections shown in flG. 12A, demonstr^ed that the pattern of hybridization was due to 
significant ltbp-2 geiie expression by perWondrial and vascular wall cells. Positive signals 
were detected, for example, in perichondral cells of cartilage aggregates located in the 
vertebral column (v), forelimb and tail, and at the base of the skuU (FIG. 12B). Indeed, the 
perichondrium (pc) of all cartilage aggregate's observed in these moiise embryo tissue sections 
was positively hybridized, ltbp-2 was also expressed by vascular wall cells of the aorta (ao), 
and in blood vessels within lung parenchyma knd within the connective tissue supporting hair 
follicle structures associated with the snout (s).\ In contrast, ltbp-2 was expressed at 
insignificant levels {i.e., below the experimental background) in the generalized 
mesenchyme/connective tissue, brain, peripheral Wrve, tooth rudiment, lung epithelium, 
cardiac and skeletal muscle, gut epithelium, liver barenchyma, pancreas epithelium and islets 
of Langerhans, brown fat cells, and kidney parencRvma. These in situ hybridization results, 
which were reproduced using independent tissue sections, demonstrate for the first time that 
ltbp-2 expression in developing mouse tissues is mord restricted than that of ltbp-3. 

' ' 3. Chromosomal Localization of Mouse ltbp-2 Gene 
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The murine ltbp-2 gene was assigned to a mouse chromosome by PGR™ analysis of 
genomic DNA from a mapping panel consisting of 19 mouse x Chinese hamster and 1 mouse 
X rat somatic cell hybrid lines as described (Li et al. , 1995). A PGR™ product of the 
expected size (600 bp) was obtained from hybrid cells that had retained mouse chromosome 
12. All other mouse chromosomes (except chromosome 12) were excluded by at least four 
discordant hybrids. Fluorescent in situ hybridization using murine ltbp-2 genomic and cDNA 
probes generate^ identical results that localized the ltbp-2 gene to mouse chromosome 12, 
band D. 14/20 metaphase spreads analyzed exhibited a fluorescent signal on both chromatids 
of chromosome 12 at the band D site, and 10/12 had signals on both chromosome 12 
homologs. No specific signals were seen on other chromosomes. This region in the mouse 
is a conserved syntenic region with human chromosome 14, band 14q24, the site of the 
human LTBP-2 genetic locus (Moren et al. , 1994), thereby providing strong support for the 
notion that ttie murine ltbp-2 gene is the true homolog of human LTBP-2. 

EXAMPLE IV 
EXPRESSION OF RECOMBINANT LTBP PROTEIN 

The Pichia Expression Kit (Invitrogen, Inc.) may be used to prepare recombinant 
LTBP protein. This kit, based on the metiiylotrophic yeast, Pichia pastoris, allows high- 
level expression of recombinant protein in an easy-to-use relatively inexpensive system. In 
the absence of the preferred carbon source, glucose, P. pastoris utilizes methanol as a carbon 
source. The AOXl promoter controls the gene that codes for the expression of the enzyme 
alcohol oxidase, which catalyzes the first step in the metabolism of methanol. This 
promoter, which is induced by methanol, has been characterized and incorporated into a 
series of Pichia expression vectors. This feature of Pichia has been exploited to express high 
levels of recombinant proteins often in the range of grams per liter. Because it is eukaryotic, 
P. pastoris utilizes posttranslational modification pathways that are similar to those used by 
mammalian cells. This implies that the recombinant LTBP-2 or LTBP-3 protein will be 
glycosylated and will contain disulfide bonds. 
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For preparation of a recombinant LTBP-2 or LTBP-3 protein, the native LTBP-2 or 
LTBP-3 cDNA is modified by the addition of a commercially available epitope tag (the HA 
epitope, Pharmacia, LKB Biotechnology, Inc.). Such fragments may be readily prepared by, 
for example, directly synthesizing the fragment by chemical means, by application of nucleic 
5 acid reproduction technology, such as the PGR™ technology of U.S. Patent 4,683,202 (herein 
incorporated H3 reference) or by introducing selected sequences into recombinant vectors for 
recombinant production. (PGR™ is a regi tercd trad<;mark of Hoffmann-L£ loche, Inc.). This 
is followed by clonmg into the Pichia expression vector. The resulting plasmid is 
characterized by DNA sequence analysis, linearized by digestion with Notl, and spheroplasts 
10 will be prepared and transformed with the linearized construct according to the 
manufacturer's recommendations. 

y Transformation facilitates a recombmation event in vivo between the 5' and 3' AOXl 

W sequences in the Pichia vector and those in the Pichia genome. The result is the replacement 

f\3.5 of AOXl with the gene of interest. 

Cfi 

yl Transformants are then plated on histidine-deficient media, which will select for 

b successfully transformed cells. Transformants are further selected against slow growth on 
J growth media containing methanol. Positive transformants are grown for 2 days in liquid 
TC20 culmre and then for 2-6 days in broth that uses methanol as the sole carbon source. Protein 
5 expression is evaluated by sodium dodecyl sulfate-polyacrylamide gel electrophoresis (SDS- 
PAGE) and Western hybridization using a commercially available polyclonal antisera to the 
HA epitope (Pharmacia). Recombinant LTBP-2 or LTBP-3 protein may be purified 
according to the manufacturer's recommendations, dialyzed against double distilled, 
25 deionized water and lyophilized in 10 mg aliquots. 
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All of the compositions and methods disclosed and claimed herein can be made and 
executed without undue experimentation in light of the present disclosure. While the 
5 compositions and methods of this invention have been described in terms of preferred 

embodiments, it will be apparent to tiiose of skill in the art that variations may be applied to 
tiie composition, methods and in the steps or in the sequence of steps of the method 
described herein without departing from the concept, spirit and scope of the invention. More 
specifically, it will be apparent that certain agents which are both chemically and 

10 physiologically related may be substituted for the agents described herein while the same or 
similar results would be achieved. All such similar substitutes and modifications apparent to 
those skilled in the art are deemed to be within the spirit, scope and concept of the invention 

Jfi as defined by the appended claims. 
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GENERAL INFORMATION: 

(i) APPLICANT: Bonadio, Jeffrey 
Yin, Wusham 

(ii) TITLE OF INVENTION: LATENT TGFj3 BINDING PROTEIN (LTBP) GENES, 

CONJPOSITIONS AND METHODS 

(iii) NUMBER OF SEQUENCES: 13 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Arnold, 

(B) STREET: P.O. Box 443 

(C) CITY: Houston 

(D) STATE: Texas 

(E) COUNTRY: United State 

(F) ZIP: 77210 



ite Sc Durkee 



of America 



as 



(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy dif 

(B) COMPUTER: IBM PC compadible 

(C) OPERATING SYSTEM: PC -DOB /MS -DOS /AS CI I 

(D) SOFTWARE: Patent In Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: Unkfaown 

(B) FILING DATE: Concurrent!^ Herewith 

(C) CLASSIFICATION: Unknown 



"35 



to 



45 



(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US PC'IYUS95/02251 

(B) FILING DATEj 21-FEB-1995 
(Cl CLASSIFICATION: Unknown 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/3^^6,650 
(3) FILING DATE: 30-SEP-1994 

(C) CLASSIFICATION: Unknown 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/199^780 

(B) FILING DATE: 18-FEB-1994 

(C) CLASSIFICATION: Unknown 
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(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Fussey, Shelley P.M. 

(B) REGISTRATION NUMBER: 39,458 

(C) REFERENCE/DOCKET NUMBER: UMIC : 



113 /MOO 
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(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (512) 418-3000 

(B) TELEFAX: (713) 789-2679 

(C) TELEX: 79-0924 
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(2) INFORMATION FOR SEQ ID N0:1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5499 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

5 (ii) MOLECUIiE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "DNA" 

(ix) FEATURE: 

(A) NAME/KEY: CDS 
10 (B) LOCATION: 1..5499 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 

ATG GAG AGC ACC TCC CCG CGA GGT CTC CGG TGC CCA CAG CTC TGC AGC 48 
15 Met Glu Ser Thr Ser Pro Arg Gly Leu Arg Cys Pro Gin Leu Cys Ser 
15 10 15 

CAC TCT GGC GCC ATG AGA GCG CCG ACC ACC GCT CGC TGC TCC GGA TGC 96 
His Ser Gly Ala Met Arg Ala Pro Thr Thr Ala Arg Cys Ser Gly Cys 
20 20 25 30 

ATC CAA CGG GTG CGT TGG AGG GGC TTC CTG CCA CTT GTC CTG GCT GTC 144 
lie Gin Arg Val Arg Trp Arg Gly Phe Leu Pro Leu Val Leu Ala Val 
35 40 45 

J5 

U TTG ATG GGG ACA AGT CAT GCC CAA CGG GAT TCC ATA GGG AGA TAC GAA 192 

Leu Met Gly Thr Ser His Ala Gin Arg Asp Ser lie Gly Arg Tyr Glu 

||j 50 55 60 

^§0 CCA GCT AGC AGG GAT GCG AAT CGG TTG TGG CAC CCC GTG GGC AGC CAC 24 0 

rO Pro Ala Ser Arg Asp Ala Asn Arg Leu Trp His Pro Val Gly Ser His 

gi 65 70 75 80 

ft'! 

CCC GCA GCG GCT GCA GCC AAG GTG TAC AGT CTG TTC CGA GAG CCT GAC 288 
^§5 Pro Ala Ala Ala Ala Ala Lys Val Tyr Ser Leu Phe Arg Glu Pro Asp 

= ' ^ ^ 85 ■ — ^ ' 90 95 

ifl GCG CCG GTC CCC GGC TTG TCG CCC TCT GAG TGG AAC CAG CCG GCC CAG 336 

Ala Pro Val Pro Gly Leu Ser Pro Ser Glu Trp Asn Gin Pro Ala Gin 
H$0 100 105 110 

O GGG AAC CCG GGA TGG CTC GCA GAG GCC GAG GCC AGG AGG CCA CCT CGA 384 

Gly Asn Pro Gly Trp Leu Ala Glu Ala Glu Ala Arg Arg Pro Pro Arg 
115 120 125 
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ACC CAG CAG CTG CGT CGA GTC CAG CCA CCT GTC CAG ACT CGG AGA AGC 432 
Thr Gin Gin Leu Arg Arg Val Gin Pro Pro Val Gin Thr Arg Arg Ser 
130 135 140 

50 CAT CCC CGG GGC CAG CAG CAG ATA GCA GCC CGG GCT GCA CCT TCT GTC 480 

His Pro Arg Gly Gin Gin Gin He Ala Ala Arg Ala Ala Pro Ser Val 
145 150 155 160 

GCG CGC CTG GAA ACC CCT CAG CGA CCC GCG GCT GCA CGG CGA GGG CGG 528 
55 Ala Arg Leu Glu Thr Pro Gin Arg Pro Ala Ala Ala Arg Arg Gly Arg 

165 170 175 

CTC ACT GGG AGA AAT GTC TGC GGG GGA CAG TGC TGC CCA GGA TGG ACA 576 
Leu Thr Gly Arg Asn Val Cys Gly Gly Gin Cys Cys Pro Gly Trp Thr 
60 180 185 190 

ACA TCA AAC AGC ACC AAC CAC TGT ATC AAA CCT GTG TGT CAG CCT CCC 624 
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Thr Ser Asn Ser Thr Asn His Cys lie Lys Pro Val Cys Gin Pro Pro 
195 200 205 

TGT CAG AAC CGA GGC TCC TGC AGC AGG CCC CAG GTC TGC ATC TGC CGT 672 
5 Cys Gin Asn Arg Gly Ser Cys Ser Arg Pro Gin Val Cys He Cys Arg 
210 215 220 

TCT GGC TTC CGT GGG GCG CGC TGT GAG GAG GTC ATC CCT GAG GAG GAA 720 
Ser Gly Phe Arg Gly Ala Arg Cys Glu Glu Val He Pro Glu Glu Glu 
10 225 230 235 240 

TTT GAC CCT CAG AAT GCC AGG CCT GTG CCC AGA CGC TCA GTG GAG AGA 768 
Phe Asp Pro Gin Asn Ala Arg Pro Val Pro Arg Arg Ser Val Glu Arg 
245 250 255 



15 



is 
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GCA CCC GGT CCT CAC AGA AGC AGT GAG GCC AGA GGA AGT CTA GTG ACC 816 
Ala Pro Gly Pro His Arg Ser Ser Glu Ala Arg Gly Ser Leu Val Thr 
260 265 270 



20 AGA ATA CAG CCG CTG GTA CCA CCA CCA TCA CCA CCT CCA TCT CGG CGC 864 

Arg He Gin Pro Leu Val Pro Pro Pro Ser Pro Pro Pro Ser Arg Arg 
275 280 285 

CTC AGC CAG CCC TGG CCC CTG CAG CAG CAC TCA GGG CCG TCC AGG ACA 912 
,25 Leu Ser Gin Pro Trp Pro Leu Gin Gin His Ser Gly Pro Ser Arg Thr 

IJ 290 295 300 

IJ1 GTT CGT CGG TAT CCG GCC ACT GGT GCC AAT GGC CAG CTG ATG TCC AAC 960 

Val Arg Arg Tyr Pro Ala Thr Gly Ala Asn Gly Gin Leu Met Ser Asn 
3D 305 310 315 320 

ru 

01 GCT TTG CCT TCA GGA CTC GAG CTG AGA GAC AGC AGC CCA CAG GCA GCA 1008 

fp Ala Leu Pro Ser Gly Leu Glu Leu Arg Asp Ser Ser Pro Gin Ala Ala 

325 330 335 



CAT GTG AAC TAT "CTC TCA CCGHCCC TGG GGG CTG AAC CTC ACC GAG AAA 1056 
His Val Asn His Leu Ser Pro Pro Trp Gly Leu Asn Leu Thr Glu Lys 
340 345 350 



'^40 ATC AAG AAA ATC AAA GTC GTC TTC ACC CCC ACC ATC TGC AAG CAG ACC 1104 

pd He Lys Lys He Lys Val Val Phe Thr Pro Thr He Cys Lys Gin Thr 

355 360 365 



TGT GCC CGG GGA CGC TGT GCC AAC AGC TGT GAG AAG GGT GAC ACC ACC 1152 
45 Cys Ala Arg Gly Arg Cys Ala Asn Ser Cys Glu Lys Gly Asp Thr Thr 

370 375 380 

ACC TTG TAG AGT CAG GGT GGC CAT GGG CAT GAC CCC AAG TCT GGC TTC 1200 
Thr Leu Tyr Ser Gin Gly Gly His Gly His Asp Pro Lys Ser Gly Phe 
50 385 390 395 400 

CGT ATC TAT TTC TGC CAA ATC CCC TGC CTG AAT GGT GGC CGC TGC ATC 1248 
Arg He Tyr Phe Cys Gin He Pro Cys Leu Asn Gly Gly Arg Cys He 
405 410 415 



GGC CGG GAC GAG TGC TGG TGT CCA GCC AAC TCC ACA GGA AAG TTC TGC 12 96 

Gly Arg Asp Glu Cys Trp Cys Pro Ala Asn Ser Thr Gly Lys Phe Cys 
420 425 430 



60 CAT CTG CCT GTC CCG CAG CCA GAC AGG GAA CCT GCA GGG CGA GGT TCC 1344 

His Leu Pro Val Pro Gin Pro Asp Arg Glu Pro Ala Gly Arg Gly Ser 
435 440 445 
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CGG CAC AGA ACC CTG CTG GAA GOT CCC CTG AAG CAA TCC ACC TTC ACG 1392 
Arg His Arg Thr Leu Leu Glu Gly Pro Leu Lys Gin Ser Thr Phe Thr 
450 455 460 

5 CTG CCT CTC TCT AAC CAG CTC GCC TCT GTG AAC CCC TCG CTG GTG AAG 1440 

^ Leu Pro Leu Ser Asn Gin Leu Ala Ser Val Asn Pro Ser Leu Val Lys 
465 470 475 480 

GTG CAA ATT CAT CAC CCG CCT GAG GCC TCT GTG CAG ATT CAC CAG GTG 1488 
10 Val Gin lie His His Pro Pro Glu Ala Ser Val Gin lie His Gin Val 

485 490 495 

GCC CGG GTC CGG GGT GAG CTG GAC CCC GTG CTG GAG GAC AAC AGT GTG 1536 
Ala Arg Val Arg Gly Glu Leu Asp Pro Val Leu Glu Asp Asn Ser Val 
15 500 505 510 

GAG ACC AGA GCC TCT CAT CGC CCC CAC GGC AAC CTA GGC CAC AGC CCC 1584 
Glu Thr Arg Ala Ser His Arg Pro His Gly Asn Leu Gly His Ser Pro 
515 520 525 



20 



m 



60 



TGG GCC AGC AAC AGC ATA CCC GCT CGG GCC GGA GAG GCC CCT CGG CCA 1632 
Trp Ala Ser Asn Ser He Pro Ala Arg Ala Gly Glu Ala Pro Arg Pro 
530 535 540 



^5 CCA CCA GTG CTG TCT AGG CAT TAT GGA CTT CTG GGC CAG TGT TAC CTG 1680 

y Pro Pro Val Leu Ser Arg His Tyr Gly Leu Leu Gly Gin Cys Tyr Leu 

^ 545 550 555 560 

V AGC ACG GTG AAT GGA CAG TGT GCT AAC CCC CTA GGT AGT CTG ACT TCT 1728 

^^0 Ser Thr Val Asn Gly Gin Cys Ala Asn Pro Leu Gly Ser Leu Thr Ser 
RJ 565 570 575 



CAG GAG GAC TGC TGT GGC AGT GTG GGG ACC TTC TGG GGG GTG ACC TCC 17 76 

Gin Glu Asp Cys Cys Gly Ser Val Gly Thr Phe Trp Gly Val Thr Ser 
II35 580 585 590 



n TGT GCT CCC TGC CCA CCC AGA CAA GAG GGT CCA GCC TTC CCA GTG ATT 1824 

Cys Ala Pro Cys Pro Pro Arg Gin Glu Gly Pro Ala Phe Pro Val He 
595 600 605 

Ho 

fU GAA AAT GGC CAG CTG GAG TGT CCC CAA GGA TAC AAG AGA CTG AAC CTC 1872 

Glu Asn Gly Gin Leu Glu Cys Pro Gin Gly Tyr Lys Arg Leu Asn Leu 
610 615 620 

t : 

45 AGC CAC TGC CAA GAT ATC AAT GAG TGC CTG ACC CTG GGC CTC TGC AAG 1920 

Ser His Cys Gin Asp He Asn Glu Cys Leu Thr Leu Gly Leu Cys Lys 
625 630 635 640 

GAC TCG GAG TGC GTG AAC ACC AGG GGC AGC TAC CTG TGC ACC TGC AGG 1968 
50 Asp Ser Glu Cys Val Asn Thr Arg Gly Ser Tyr Leu Cys Thr Cys Arg 

645 650 655 

CCT GGC CTC ATG CTG GAT CCG TCA AGG AGC CGC TGC GTA TCG GAC AAG 2016 
Pro Gly Leu Met Leu Asp Pro Ser Arg Ser Arg Cys Val Ser Asp Lys 
55 660 665 670 

GCT GTC TCC ATG CAG CAG GGA CTA TGC TAC CGG TCA CTG GGG TCT GGT 2064 
Ala Val Ser Met Gin Gin Gly Leu Cys Tyr Arg Ser Leu Gly Ser Gly 
675 680 685 



ACC TGC ACC CTG CCT TTG GTT CAT CGG ATC ACC AAG CAG ATA TGC TGC 2112 
Thr Cys Thr Leu Pro Leu Val His Arg He Thr Lys Gin He Cys Cys 
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10 



50 



690 695 700 

TGC AGC CGT GTG GGC AAA GCC TGG GGT AGC ACA TGT GAA CAG TGT CCC 2160 
Cys Ser Arg Val Gly Lys Ala Trp Gly Ser Thr Cys Glu Gin Cys Pro 
705 710 715 720 

CTG CCT GGC ACA GAA GCC TTC AGG GAG ATC TGC CCT GCT GGC CAT GGC 2208 
Leu Pro Gly Thr Glu Ala Phe Arg Glu He Cys Pro Ala Gly His Gly 
725 730 735 

TAC ACC TAC TCG AGC TCA GAC ATC CGC CTG TCT ATG AGG AAA GCC GAA 2256 
Tyr Thr Tyr Ser Ser Ser Asp He Arg Leu Ser Met Arg Lys Ala Glu 
-740 745 750 



15 GAA GAG GAA CTG GCT AGC CCC TTA AGG GAG CAG ACA GAG CAG AGC ACT 2304 

Glu Glu Glu Leu Ala Ser Pro Leu Arg Glu Gin Thr Glu Gin Ser Thr 
755 760 765 

GCA CCC CCA CCT GGG CAA GCA GAG AGG CAA CCA CTC CGG GCA GCC ACC 23 52 

20 Ala Pro Pro Pro Gly Gin Ala Glu Arg Gin Pro Leu Arg Ala Ala Thr 
770 775 780 

GCC ACC TGG ATT GAG GCT GAG ACC CTC CCT GAC AAA GGT GAC TCT CGG 2400 
Ala Thr Trp He Glu Ala Glu Thr Leu Pro Asp Lys Gly Asp Ser Arg 
JZ5 785 790 795 800 

d GCT GTT CAG ATC ACA ACC AGT GCT CCC CAC CTA CCT GCC CGG GTA CCA 2448 

III Ala Val Gin He Thr Thr Ser Ala Pro His Leu Pro Ala Arg Val Pro 

p 805 810 815 

30 

;i GGG GAT GCC ACT GGA AGA CCA GCA CCA TCC TTG CCT GGA CAG GGC ATT 2496 

0"^ Gly Asp Ala Thr Gly Arg Pro Ala Pro Ser Leu Pro Gly Gin Gly He 
m 820 825 830 



"35 CCA GAG Ai3T rCA^CA GAA GAC1.CAA GTG ATT CCC TCC AGT GAT GTC TTG 2544 

" Pro Glu Ser Pro Ala Glu Glu Gin Val He Pro Ser Ser Asp Val Leu 

G 835 840 845 



GTG ACA CAC AGC CCC CCA GAC TTT GAT CCA TGT TTT GCT GGA GCC TCC 2592 
1^0 Val Thr His Ser Pro Pro Asp Phe Asp Pro Cys Phe Ala Gly Ala Ser 
rif 850 855 860 

□ AAC ATC TGT GGC CCT GGG ACC TGT GTG AGC CTC CCA AAT GGA TAC AGA 2640 

Asn He Cys Gly Pro Gly Thr Cys Val Ser Leu Pro Asn Gly Tyr Arg 
45 865 870 875 880 

TGT GTC TGC AGC CCT GGC TAC CAG CTA CAC CCC AGC CAA GAC TAC TGT 2688 
Cys Val Cys Ser Pro Gly Tyr Gin Leu His Pro Ser Gin Asp Tyr Cys 
885 890 895 



ACT GAT GAC AAC GAG TGT ATG AGG AAC CCC TGT GAA GGA AGA GGG CGC 2736 
Thr Asp Asp Asn Glu Cys Met Arg Asn Pro Cys Glu Gly Arg Gly Arg 
900 905 910 



55 TGT GTC AAC AGT GTG GGC TCC TAC TCC TGC CTC TGC TAT CCT GGC TAC 2 784 

Cys Val Asn Ser Val Gly Ser Tyr Ser Cys Leu Cys Tyr Pro Gly Tyr 
915 920 925 

ACA CTA GTC ACC CTC GGA GAC ACA CAG GAG TGC CAA GAT ATC GAT GAG 2832 
60 Thr Leu Val Thr Leu Gly Asp Thr Gin Glu Cys Gin Asp He Asp Glu 
930 935 940 
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TGT GAG CAG CCC GGG GTG TGC AGT GGT GGG CGA TGC AGC AAC ACG GAG 2880 
Cys Glu Gin Pro Gly Val Cys Ser Gly Gly Arg Cys Ser Asn Thr Glu 
945 950 955 960 

5 GGC TCG TAG CAC TGC GAG TGT GAT CGG GGC TAG ATC ATG GTC AGG AAA 2928 

Gly Ser Tyr Kis Cys Glu Cys Asp Arg Gly Tyr lie Met Val Arg Lys 
965 970 975 

GGA CAC TGT CAA GAT ATC AAC GAA TGC CGT CAC CCT GGT ACC TGC CCT 2 976 

10 Gly His Cys Gin Asp lie Asn Glu Cys Arg His Pro Gly Thr Cys Pro 

980 985 990 

GAT GGG AGA TGC GTC AAC TCC CCT GGC TCC TAC ACT TGT CTG GCC TG^ 3024 
Asp Gly Arg Cys Val Asn Ser Pro Gly Ser Tyr Thr Cys Leu Ala Cys 
15 995 1000 1005 

GAG GAG GGC TAT GTA GGC CAG AGT GGG AGC TGT GTA GAT GTC AAT GAG 3072 
Glu Glu Gly Tyr Val Gly Gin Ser Gly Ser Cys Val Asp Val Asn Glu 
1010 1015 1020 

TGT CTG ACC CCT GGG ATA TGT ACC CAT GGA AGG TGC ATC AAC ATG GAA 3120 
Cys Leu Thr Pro Gly He Cys Thr His Gly Arg Cys He Asn Met Glu 
1025 1030 1035 1040 



20 



25 GGC TCC TTT AGA TGC TCC TGT GAG CCG GGC TAT GAG GTC ACC CCA GAC 3168 

^ Gly Ser Phe Arg Cys Ser Cys Glu Pro Gly Tyr Glu Val Thr Pro Asp 

Ul 1045 1050 1055 



m 



AAG AAG GGC TGC CGA GAT GTG GAC GAG TGT GCC AGC CGA GCC TCG TGC 3216 
b Lys Lys Gly Cys Arg Asp Val Asp Glu Cys Ala Ser Arg Ala Ser Cys 

J y y ^ ^^^^ ^ 1070 



01 
03 



.CAT, 

iD 



60 



CCC ACG GGC CTC TGC CTC AAC ACG GAG GGC TCC TTC ACC TGC TCA GCC 3264 
Pro Thr Gly Leu Cys Leu Asn Thr Glu Gly Ser Phe Thr Cys Ser Ala 
1075^ _ -JJ^SO 1085 

TGT CAG AGC GGG TAC TGG GTG AAC GAA GAT GGC ACT GCC TGT GAA GAC 3 312 

Cys Gin Ser Gly Tyr Trp Val Asn Glu Asp Gly Thr Ala Cys Glu Asp 
1090 1095 1100 



TTG GAT GAA TGT GCC TTC CCT GGA GTC TGC CCC ACA GGC GTC TGC ACC 3360 
_ Leu Asp Glu Cys Ala Phe Pro Gly Val Cys Pro Thr Gly Val Cys Thr 

g 1105 1110 1115 1120 

45 AAT ACT GTA GGC TCC TTC TCC TGC AAG GAC TGT GAC CAG GGC TAC CGG 3408 

Asn Thr Val Gly Ser Phe Ser Cys Lys Asp Cys Asp Gin Gly Tyr Arg 
1125 1130 1135 

CCC AAC CCC CTG GGC AAC AGA TGC GAA GAT GTG GAT GAG TGT GAA GGT 3456 
50 Pro Asn Pro Leu Gly Asn Arg Cys Glu Asp Val Asp Glu Cys Glu Gly 

1140 1145 1150 

CCC CAA AGC AGC TGC CGG GGA GGC GAA TGC AAG AAC ACA GAA GGT TCC 3 504 

Pro Gin Ser Ser Cys Arg Gly Gly Glu Cys Lys Asn Thr Glu Gly Ser 
55 1155 1160 1165 

TAC CAA TGC CTC TGT CAC CAG GGC TTC CAG CTG GTC AAT GGC ACC ATG 3552 
Tyr Gin Cys Leu Cys His Gin Gly Phe Gin Leu Val Asn Gly Thr Met 
1170 1175 1180 



TGT GAG GAC GTG AAT GAG TGT GTT GGG GAA GAG CAT TGT GCT CCT CAC 3600 
Cys Glu Asp Val Asn Glu Cys Val Gly Glu Glu His Cys Ala Pro His 
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1185 



1190 1195 1200 
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GGC GAG TGC CTC AAC AGC CTG GGC TCC TTC TTC TGC CTC TGT GCA CCC 3648 
Glv Glu Cys Leu Asn Ser Leu Gly Ser Phe Phe Cys Leu Cys Ala Pro 
1205 1210 1215 

GGC TTT GCT AGT GCT GAG GGG GGC ACC AGA TGC CAG GAT GTT GAT GAA 3696 
Gly Phe Ala Ser Ala Glu Gly Gly Thr Arg Cys Gin Asp Val Asp Glu 
1220 1225 1230 

TGT GCA GCC ACA GAC CCG TGT CCG GGA GGA CAC TGT GTC AAC ACA GAG 3744 
Cys Ala Ala Thr Asp Pro Cys Pro Gly Gly His Cys Val Asn Thr Glu 
1235 1240 1245 

15 GGC TCC TTC AGC TGT CTG TGT GAG ACT GCT TCC TTC CAG CCC TCC CCA 3 792 

Gly Ser Phe Ser Cys Leu Cys Glu Thr Ala Ser Phe Gin Pro Ser Pro 
1250 1255 1260 

GAC AGC GGA GAA TGT TTG GAT ATT GAT GAG TGT GAG GAC CGT GAA GAC 3840 
20 Asp Ser Gly Glu Cys Leu Asp He Asp Glu Cys Glu Asp Arg Glu Asp 
1265 1270 1275 1280 

CCG GTG TGC GGA GCC TGG AGG TGT GAG AAC AGT CCT GGT TCC TAC CGC 3888 
Pro Val Cys Gly Ala Trp Arg Cys Glu Asn Ser Pro Gly Ser Tyr Arg 
g5 1285 1290 1295 

TGC ATC CTG GAC TGC CAG CCT GGA TTC TAT GTG GCG CCA AAT GGA GAC 3936 
Cys He Leu Asp Cys Gin Pro Gly Phe Tyr Val Ala Pro Asn Gly Asp 
m 1300 1305 1310 

rSO 

TGC ATT GAC ATA GAT GAA TGT GCC AAT GAC ACT GTG TGT GGG AAC CAT 3984 

Cys He Asp He Asp Glu Cys Ala Asn Asp Thr Val Cys Gly Asn His 
03 1315 1320 1325 



m 



is GGC TTC TGT ^AC^C ACG GAC GGC TCC TTC CGC TGC CTG TGT GAC CAG 4032 

Gly Phe Cys Asp Asn Thr Asp Gly Ser Phe Arg Cys Leu Cys Asp Gin 
1330 1335 1340 

GGC TTC GAG ACC TCA CCA TCA GGC TGG GAG TGT GTT GAT GTG AAC GAG 408 0 

f40 Gly Phe Glu Thr Ser Pro Ser Gly Trp Glu Cys Val Asp Val Asn Glu 
1345 1350 1355 1360 

Q TGT GAG CTC ATG ATG GCA GTG TGT GGG GAT GCG CTC TGT GAG AAC GTG 4128 

Cys Glu Leu Met Met Ala Val Cys Gly Asp Ala Leu Cys Glu Asn Val 
45 1365 1370 1375 

GAA GGC TCC TTC CTG TGC CTT TGC GCC AGT GAC CTT GAG GAG TAC GAC 4176 
Glu Gly Ser Phe Leu Cys Leu Cys Ala Ser Asp Leu Glu Glu Tyr Asp 
1380 1385 1390 

GCA GAA GAA GGA CAC TGC CGT CCT CGG GTG GCT GGA GCT CAG AGA ATC 4224 
Ala Glu Glu Gly His Cys Arg Pro Arg Val Ala Gly Ala Gin Arg He 
1395 1400 1405 

55 CCA GAG GTC CGG ACA GAG GAC CAG GCT CCA AGC CTT ATC CGC ATG GAA 4272 

Pro Glu Val Arg Thr Glu Asp Gin Ala Pro Ser Leu He Arg Met Glu 
1410 1415 1420 

TGC TAC TCT GAA CAC AAT GGT GGT CCT CCC TGC TCT CAA ATC CTG GGC 432 0 

60 Cys Tyr Ser Glu His Asn Gly Gly Pro Pro Cys Ser Gin He Leu Gly 
1425 1430 1435 1440 



50 
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CAG AAC TCC ACA CAG GCC GAG TGC TGC TGC ACT CAG GGT GCC AGA TGG 4368 
Gin Asn Ser Thr Gin Ala Glu Cys Cys Cys Thr Gin Gly Ala Arg Trp 
1445 1450 1455 

5 GGA AAG GCC TGT GCG CCC TGC CCA TCT GAG GAC TCA GTT GAA TTC AGT 4416 

Gly Lys Ala Cys Ala Pro Cys Pro Ser Glu Asp Ser Val Glu Phe Ser 
1460 1465 1470 

CAG CTC TGC CCC AGT GGT CAA GGT TAG ATC CCA GTG GAA GGA GCC TGG 4464 
10 Gin Leu Cys Pro Ser Gly Gin Gly Tyr He Pro Val Glu Gly Ala Trp 
1475 1480 1485 

ACA TTT GGA CA^ ACC ATG TAT ACA GAT GCC GAT GAA TGT GTA CTG TTT 4512 
Thr Phe Gly Gin Thr Met Tyr Thr Asp Ala Asp Glu Cys Val Leu Phe 
15 1490 1495 1500 



20 



Ul 



60 



GGG CCT GCT CTC TGC CAG AAT GGC CGA TGC TCA AAC ATA GTG CCT GGC 4560 
Gly Pro Ala Leu Cys Gin Asn Gly Arg Cys Ser Asn He Val Pro Gly 
1505 1510 1515 1520 

TAC ATT TGC CTG TGC AAC CCT GGC TAC CAC TAT GAT GCC TCC AGC AGG 4608 
Tvr He Cys Leu Cys Asn Pro Gly Tyr His Tyr Asp Ala Ser Ser Arg 
1525 1530 1535 



f25 AAG TGC CAG GAT CAC AAC GAA TGC CAG GAC TTG GCC TGT GAG AAC GGT 4656 

^ Lys Cys Gin Asp His Asn Glu Cys Gin Asp Leu Ala Cys Glu Asn Gly 

1540 1545 1550 



GAG TGT GTG AAC CAA GAA GGC TCC TTC CAT TGC CTC TGC AAT CCC CCC 4704 
30 Glu Cys Val Asn Gin Glu Gly Ser Phe His Cys Leu Cys Asn Pro Pro 
1555 1560 1565 

ffi CTC ACC CTA GAC CTC AGT GGG CAG CGC TGT GTG AAC ACG ACC AGC AGC 4752 

Leu Thr Leu Asp Leu Ser Gly Gin Arg Cys Val Asn Thr Thr Ser Ser 
H5 1570 ^ _ 157_5_ ^ 1580 

Q ACG GAG GAC TTC CCT GAC CAT GAC ATC CAC ATG GAC ATC TGC TGG AAA 4800 

rf\ Thr Glu Asp Phe Pro Asp His Asp He His Met Asp He Cys Trp Lys 

r,' 1585 1590 1595 1600 



AAA GTC ACC AAT GAT GTG TGC AGC CAG CCC TTG CGT GGG CAC CAT ACC 4848 
Lys Val Thr Asn Asp Val Cys Ser Gin Pro Leu Arg Gly His His Thr 
1605 1610 1615 



45 ACC TAT ACA GAA TGC TGC TGC CAA GAT GGG GAG GCC TGG AGC CAG CAA 4896 

Thr Tyr Thr Glu Cys Cys Cys Gin Asp Gly Glu Ala Trp Ser Gin Gin 
1620 1625 1630 

TGC GCT CTG TGC CCG CCC AGG AGC TCT GAG GTC TAC GCT CAG CTG TGC 4 944 

50 Cys Ala Leu Cys Pro Pro Arg Ser Ser Glu Val Tyr Ala Gin Leu Cys 
1635 1640 1645 

AAC GTG GCT CGG ATT GAG GCA GAG CGC GGA GCA GGG ATC CAC TTC CGG 4992 
Asn Val Ala Arg He Glu Ala Glu Arg Gly Ala Gly He His Phe Arg 
55 1650 1655 1660 

CCA GGC TAT GAG TAT GGC CCT GGC CTG GAC GAT CTG CCT GAA AAC CTC 5040 
Pro Gly Tyr Glu Tyr Gly Pro Gly Leu Asp Asp Leu Pro Glu Asn Leu 
1665 1670 1675 1680 



TAC GGC CCA GAT GGG GCT CCC TTC TAT AAC TAC CTA GGC CCC GAG GAC 5088 
Tyr Gly Pro Asp Gly Ala Pro Phe Tyr Asn Tyr Leu Gly Pro Glu Asp 
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1685 



1690 1695 



10 



ACT GCC CCT GAG CCT CCC TTC TCC AAC CCA GCC AGC CAG CCG GGA GAC 5136 
Thr Ala Pro Glu Pro Pro Phe Ser Asn Pro Ala Ser Gin Pro Gly Asp 
1700 1705 1710 

AAC ACA CCT GTC CTT GAG CCT CCT CTG CAG CCC TCT GAA CTT CAG CCT 5184 
Asn Thr Pro Val Leu Glu Pro Pro Leu Gin Pro Ser Glu Leu Gin Pro 
1715 1720 1725 

CAC TAT CTA GCC AGC CAC TCA GAA CCC CCT GCC TCC TTC GAA GGC CTT 5232 
His Tyr Leu Ala Ser His Ser Glu Pro Pro Ala Ser Phe Glu Gly Leu 
1730 1735 1740 

15 CAG GCT GAG GAA TGT GGC ATC CTG AAT GGC TGT GAG AAT GGC CGC TGC 5280 

Gin Ala Glu Glu Cys Gly lie Leu Asn Gly Cys Glu Asn Gly Arg Cys 
1745 1750 1755 1760 

GTG CGT GTG CGG GAG GGC TAC ACT TGC GAC TGC TTT GAG GGC TTC CAG 5328 
20 Val Arg Val Arg Glu Gly Tyr Thr Cys Asp Cys Phe Glu Gly Phe Gin 

1765 1770 1775 

CTG GAT GCG CCC ACA TTG GCC TGT GTG GAT GTG AAC GAG TGT GAA GAC 5376 
Leu Asp Ala Pro Thr Leu Ala Cys Val Asp Val Asn Glu Cys Glu Asp 
^5 1780 1785 1790 

^ TTG AAC GGG CCT GCA CGA CTC TGT GCA CAC GGT CAC TGT GAG AAC ACA 5424 

Lfi Leu Asn Gly Pro Ala Arg Leu Cys Ala His Gly His Cys Glu Asn Thr 

~d 1795 1800 1805 

^30 

— GAG GGT TCC TAT CGC TGC CAC TGT TCG CCA GGT TAC GTG GCA GAG CCA 5472 

ffi Glu Gly Ser Tyr Arg Cys His Cys Ser Pro Gly Tyr Val Ala Glu Pro 
fQ 1810 1815 1820 

^^5 GGC CCC CCA CACJTGT GCG GCC AAG GAG 5499 



50 



Gly Pro Pro His Cys Ala Ala Lys Glu 
1825 1830 



LlO (2) INFORMATION FOR SEQ ID NO: 2: 

rs : 

: 

P (i) SEQUENCE CHARACTERISTICS: 

g (A) LENGTH: 1833 amino acids 

(B) TYPE: amino acid 
45 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:2: 

Met Glu Ser Thr Ser Pro Arg Gly Leu Arg Cys Pro Gin Leu Cys Ser 
15 10 15 

His Ser Gly Ala Met Arg Ala Pro Thr Thr Ala Arg Cys Ser Gly Cys 
55 20 25 30 

He Gin Arg Val Arg Trp Arg Gly Phe Leu Pro Leu Val Leu Ala Val 
35 40 45 

60 Leu Met Gly Thr Ser His Ala Gin Arg Asp Ser He Gly Arg Tyr Glu 
50 55 60 
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Pro Ala Ser Arg Asp Ala Asn Arg Leu Trp His Pro Val Gly Ser His 
65 70 75 80 

Pro Ala Ala Ala Ala Ala Lys Val Tyr Ser Leu Phe Arg Glu Pro Asp 
5 85 90 95 

Ala Pro Val Pro Gly Leu Ser Pro Ser Glu Trp Asn Gin Pro Ala Gin 
100 105 110 

10 Gly Asn Pro Gly Trp Leu Ala Glu Ala Glu Ala Arg Arg Pro Pro Arg 
115 120 125 



15 



45 



60 



Thr Gin Gin Leu Arg Arg Val Gin Pro Pro Val Gin Thr Arg Arg Ser 
130 135 140 

His Pro Arg Gly Gin Gin Gin He Ala Ala Arg Ala Ala Pro Ser Val 
145 150 155 160 



Ala Arg Leu Glu Thr Pro Gin Arg Pro Ala Ala Ala Arg Arg Gly Arg 
20 165 170 175 

Leu Thr Gly Arg Asn Val Cys Gly Gly Gin Cys Cys Pro Gly Trp Thr 
180 185 190 

25 Thr Ser Asn Ser Thr Asn His Cys He Lys Pro Val Cys Gin Pro Pro 
^ 195 200 205 

Lf! Cys Gin Asn Arg Gly Ser Cys Ser Arg Pro Gin Val Cys He Cys Arg 

,r\ 210 215 220 

m 

Ser Gly Phe Arg Gly Ala Arg Cys Glu Glu Val He Pro Glu Glu Glu 
P 225 230 235 240 

iQ 

Phe Asp Pro Gin Asn Ala Arg Pro Val Pro Arg Arg Ser Val Glu Arg 
y5 ^ , J245 _ _ _ 250 255 

O Ala Pro Gly Pro His Arg Ser Ser Glu Ala Arg Gly Ser Leu Val Thr 

m 260 265 270 

y - 

3d Arg He Gin Pro Leu Val Pro Pro Pro Ser Pro Pro Pro Ser Arg Arg 
rU 275 280 285 



Leu Ser Gin Pro Trp Pro Leu Gin Gin His Ser Gly Pro Ser Arg Thr 
290 295 300 

Val Arg Arg Tyr Pro Ala Thr Gly Ala Asn Gly Gin Leu Met Ser Asn 
305 310 315 320 



Ala Leu Pro Ser Gly Leu Glu Leu Arg Asp Ser Ser Pro Gin Ala Ala 
50 325 330 335 

His Val Asn His Leu Ser Pro Pro Trp Gly Leu Asn Leu Thr Glu Lys 
340 345 350 

55 He Lys Lys He Lys Val Val Phe Thr Pro Thr He Cys Lys Gin Thr 

355 360 365 



Cys Ala Arg Gly Arg Cys Ala Asn Ser Cys Glu Lys Gly Asp Thr Thr 
370 375 380 

Thr Leu Tyr Ser Gin Gly Gly His Gly His Asp Pro Lys Ser Gly Phe 
385 390 395 400 
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Arg He Tyr Phe Cys Gin He Pro Cys Leu Asn Gly Gly Arg Cys He 
405 ^■^^ 

Glv Arq ASP Glu Cys Trp Cys Pro Ala Asn Ser Thr Gly Lys Phe Cys 
420 425 430 

His Leu Pro Val Pro Gin Pro Asp Arg Glu Pro Ala Gly Arg Gly Ser 
435 440 445 

Arg His Arg Thr Leu Leu Glu Gly Pro Leu Lys Gin Ser Thr Phe Thr 
450 455 460 

Leu Pro Leu Ser Asn Gin Leu Ala Ser Val Asn Pro Ser Leu Val Lys 
465 470 475 480 

Val Gin He His His Pro Pro Glu Ala Ser Val Gin He His Gin Val 
485 490 495 

Ala Arg Val Arg Gly Glu Leu Asp Pro Val Leu Glu Asp Asn Ser Val 
500 505 510 

Glu Thr Arg Ala Ser His Arg Pro His Gly Asn Leu Gly His Ser Pro 
515 520 525 

Trp Ala Ser Asn Ser He Pro Ala Arg Ala Gly Glu Ala Pro Arg Pro 
530 535 540 

Pro Pro Val Leu Ser Arg His Tyr Gly Leu Leu Gly Gin Cys Tyr Leu 
545 550 555 560 

Ser Thr Val Asn Gly Gin Cys Ala Asn Pro Leu Gly Ser Leu Thr Ser 
565 570 575 

Gin Glu Asp Cys Cys Gly Ser Val Gly Thr Phe Trp Gly Val Thr Ser 
580_ _ _^ . 585 590 

Cys Ala Pro Cys Pro Pro Arg Gin Glu Gly Pro Ala Phe Pro Val He 
595 600 605 

Glu Asn Gly Gin Leu Glu Cys Pro Gin Gly Tyr Lys Arg Leu Asn Leu 
610 615 620 

Ser His Cys Gin Asp He Asn Glu Cys Leu Thr Leu Gly Leu Cys Lys 
625 630 635 640 

Asp Ser Glu Cys Val Asn Thr Arg Gly Ser Tyr Leu Cys Thr Cys Arg 
645 650 655 

Pro Gly Leu Met Leu Asp Pro Ser Arg Ser Arg Cys Val Ser Asp Lys 
660 665 670 

Ala Val Ser Met Gin Gin Gly Leu Cys Tyr Arg Ser Leu Gly Ser Gly 
675 680 685 

Thr Cys Thr Leu Pro Leu Val His Arg He Thr Lys Gin He Cys Cys 
690 695 700 

Cys Ser Arg Val Gly Lys Ala Trp Gly Ser Thr Cys Glu Gin Cys Pro 
705 710 715 720 

Leu Pro Gly Thr Glu Ala Phe Arg Glu He Cys Pro Ala Gly His Gly 
725 730 735 
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Tvr Thr Tvr Ser Ser Ser Asp He Arg Leu Ser Met Arg Lys Ala Glu 
740 745 750 

Glu Glu Glu Leu Ala Ser Pro Leu Arg Glu Gin Thr Glu Gin Ser Thr 
5 ^ 755 760 765 

Ala Pro Pro Pro Gly Gin Ala Glu Arg Gin Pro Leu Arg Ala Ala Thr 
770 775 780 

10 Ala Thr Trp He Glu Ala Glu Thr Leu Pro Asp Lys Gly Asp Ser Arg 
785 790 795 800 



15 



Ala Val Gin He Thr Thr Ser Ala Pro His Le ^ Pro Ala Arg Val Pro 
805 810 815 

Glv Asp Ala Thr Gly Arg Pro Ala Pro Ser Leu Pro Gly Gin Gly He 
820 825 830 



Pro Glu Ser Pro Ala Glu Glu Gin Val He Pro Ser Ser Asp Val Leu 
20 835 840 845 

Val Thr His Ser Pro Pro Asp Phe Asp Pro Cys Phe Ala Gly Ala Ser 
850 855 860 

Asn He Cys Gly Pro Gly Thr Cys Val Ser Leu Pro Asn Gly Tyr Arg 
% 865 870 875 880 

in Cvs Val Cys Ser Pro Gly Tyr Gin Leu His Pro Ser Gin Asp Tyr Cys 

885 890 895 

ir^BO 

Li Thr Asp Asp Asn Glu Cys Met Arg Asn Pro Cys Glu Gly Arg Gly Arg 
0= 900 905 910 

ffi 

in Cys Val Asn Ser Val Gly Ser Tyr Ser Cys Leu Cys Tyr Pro Gly Tyr 

f 35 9,15 , _ 920 925 

O Thr Leu Val Thr Leu Gly Asp Thr Gin Glu Cys Gin Asp He Asp Glu 

m 930 935 940 

C4O Cys Glu Gin Pro Gly Val Cys Ser Gly Gly Arg Cys Ser Asn Thr Glu 

rU 945 950 955 960 

o 

^5 Gly Ser Tyr His Cys Glu Cys Asp Arg Gly Tyr He Met Val Arg Lys 

^"-^ 965 970 975 

45 

Gly His Cys Gin Asp He Asn Glu Cys Arg His Pro Gly Thr Cys Pro 
980 985 990 

Asp Gly Arg Cys Val Asn Ser Pro Gly Ser Tyr Thr Cys Leu Ala Cys 
50 995 1000 1005 

Glu Glu Gly Tyr Val Gly Gin Ser Gly Ser Cys Val Asp Val Asn Glu 
1010 1015 1020 

55 Cys Leu Thr Pro Gly He Cys Thr His Gly Arg Cys He Asn Met Glu 

1025 1030 1035 1040 

Gly Ser Phe Arg Cys Ser Cys Glu Pro Gly Tyr Glu Val Thr Pro Asp 
1045 1050 1055 

Lys Lys Gly Cys Arg Asp Val Asp Glu Cys Ala Ser Arg Ala Ser Cys 
1060 1065 1070 
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Pro Thr Gly Leu Cys Leu Asn Thr Glu Gly Ser Phe Thr Cys Ser Ala 
1075 1080 1085 

Cys Gin Ser Gly Tyr Trp Val Asn Glu Asp Gly Thr Ala Cys Glu Asp 
1090 1095 1100 

Leu Asp Glu Cys Ala Phe Pro Gly Val Cys Pro Thr Gly Val Cys Thr 
1105 1110 1115 1120 

Asn Thr Val Gly Ser Phe Ser Cys Lys Asp Cys Asp Gin Gly Tyr Arg 
1125 1130 1135 

Pro Asn Fro ^ 3u Gly Asn Arg Cys Glu Asp Val Asp Glu Cys Glu Gly 
1140 1145 1150 

Pro Gin Ser Ser Cys Arg Gly Gly Glu Cys Lys Asn Thr Glu Gly Ser 
1155 1160 1165 

Tyr Gin Cys Leu Cys His Gin Gly Phe Gin Leu Val Asn Gly Thr Met 
1170 1175 1180 

Cys Glu Asp Val Asn Glu Cys Val Gly Glu Glu His Cys Ala Pro His 
1185 1190 1195 1200 

Gly Glu Cys Leu Asn Ser Leu Gly Ser Phe Phe Cys Leu Cys Ala Pro 
1205 1210 1215 

Gly Phe Ala Ser Ala Glu Gly Gly Thr Arg Cys Gin Asp Val Asp Glu 
1220 1225 1230 

Cys Ala Ala Thr Asp Pro Cys Pro Gly Gly His Cys Val Asn Thr Glu 
1235 1240 1245 

Gly Ser Phe Ser Cys Leu Cys Glu Thr Ala Ser Phe Gin Pro Ser Pro 
1250 , _ 1255_ ^ 1260 

Asp Ser Gly Glu Cys Leu Asp He Asp Glu Cys Glu Asp Arg Glu Asp 
1265 1270 1275 1280 

Pro Val Cys Gly Ala Trp Arg Cys Glu Asn Ser Pro Gly Ser Tyr Arg 
1285 1290 1295 

Cys He Leu Asp Cys Gin Pro Gly Phe Tyr Val Ala Pro Asn Gly Asp 
1300 1305 1310 

Cys He Asp He Asp Glu Cys Ala Asn Asp Thr Val Cys Gly Asn His 
1315 1320 1325 

Gly Phe Cys Asp Asn Thr Asp Gly Ser Phe Arg Cys Leu Cys Asp Gin 
1330 1335 1340 

Gly Phe Glu Thr Ser Pro Ser Gly Trp Glu Cys Val Asp Val Asn Glu 
1345 1350 1355 1360 

Cys Glu Leu Met Met Ala Val Cys Gly Asp Ala Leu Cys Glu Asn Val 
1365 1370 1375 

Glu Gly Ser Phe Leu Cys Leu Cys Ala Ser Asp Leu Glu Glu Tyr Asp 
1380 1385 1390 

Ala Glu Glu Gly His Cys Arg Pro Arg Val Ala Gly Ala Gin Arg He 
1395 1400 1405 
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Pro Glu Val Arg Thr Glu Asp Gin Ala Pro Ser Leu He Arg Met Glu 
1410 1415 1420 

Cys Tyr Ser Glu His Asn Gly Gly Pro Pro Cys Ser Gin lie Leu Gly 
X425 1430 1435 1440 

Gin Asn Ser Thr Gin Ala Glu Cys Cys Cys Thr Gin Gly Ala Arg Trp 
1445 1450 1455 

Gly Lys Ala Cys Ala Pro Cys Pro Ser Glu Asp Ser Val Glu Phe Ser 
1460 1465 1470 

Gin Leu Cys Pro Ser Gly Gin Gly Tyr He Pro Val Glu Gly Ala Trp 
:.475 1480 1485 

Thr Phe Gly Gin Thr Met Tyr Thr Asp Ala Asp Glu Cys Val Leu Phe 
1490 1495 1500 

Gly Pro Ala Leu Cys Gin Asn Gly Arg Cys Ser Asn He Val Pro Gly 
1505 1510 1515 1520 

Tyr He Cys Leu Cys Asn Pro Gly Tyr His Tyr Asp Ala Ser Ser Arg 
1525 1530 1535 

Lys Cys Gin Asp His Asn Glu Cys Gin Asp Leu Ala Cys Glu Asn Gly 
1540 1545 1550 

Glu Cys Val Asn Gin Glu Gly Ser Phe His Cys Leu Cys Asn Pro Pro 
1555 1560 1565 

Leu Thr Leu Asp Leu Ser Gly Gin Arg Cys Val Asn Thr Thr Ser Ser 
1570 1575 1580 

Thr Glu Asp Phe Pro Asp His Asp He His Met Asp He Cys Trp Lys 
1585 1590 1595 1600 

Lys Val Thr Asn Asp Val Cys Ser Gin Pro Leu Arg Gly His His Thr 
1605 1610 1615 

Thr Tyr Thr Glu Cys Cys Cys Gin Asp Gly Glu Ala Trp Ser Gin Gin 
1620 1625 1630 

Cys Ala Leu Cys Pro Pro Arg Ser Ser Glu Val Tyr Ala Gin Leu Cys 
1635 1640 1645 

Asn Val Ala Arg He Glu Ala Glu Arg Gly Ala Gly He His Phe Arg 
1650 1655 1660 

Pro Gly Tyr Glu Tyr Gly Pro Gly Leu Asp Asp Leu Pro Glu Asn Leu 
1665 1670 1675 1680 

Tyr Gly Pro Asp Gly Ala Pro Phe Tyr Asn Tyr Leu Gly Pro Glu Asp 
1685 1690 1695 

Thr Ala Pro Glu Pro Pro Phe Ser Asn Pro Ala Ser Gin Pro Gly Asp 
1700 1705 1710 

Asn Thr Pro Val Leu Glu Pro Pro Leu Gin Pro Ser Glu Leu Gin Pro 
1715 1720 1725 

His Tyr Leu Ala Ser His Ser Glu Pro Pro Ala Ser Phe Glu Gly Leu 
1730 1735 1740 
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Gin Ala Glu Glu Cys Gly He Leu Asn Gly Cys Glu Asn Gly Arg Cys 
1745 1750 1755 1760 

Val Arg Val Arg Glu Gly Tyr Thr Cys Asp Cys Phe Glu Gly Phe Gin 
5 1765 1770 1775 

Leu Asp Ala Pro Thr Leu Ala Cys Val Asp Val Asn Glu Cys Glu Asp 
1780 1785 1790 

10 Leu Asn Gly Pro Ala Arg Leu Cys Ala His Gly His Cys Glu Asn Thr 
1795 1800 1805 
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Glu Gly Ser Ty^ Arg Cys His Cys Ser Pro Gly Tyr Val Ala Glu Pre 
1810 1815 1820 

Gly Pro Pro His Cys Ala Ala Lys Glu 
1825 1830 



20 (2) INFORMATION FOR SEQ ID N0:3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3753 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
^ ^ (D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 

(ix) FEATURE: 

(A) NAME/KEY: CDS 



m (B) LOCATION: 1..3753 

^'^5 (xi) SEQUENCE DESCRIPTION :. SEQ ID NO : 3 : 

□ ATG CGC CAG GCC GCA TTG GGG CTG CTG GCA CTA CTC CTG CTG GCG CTG 48 

m Met Arg Gin Ala Ala Leu Gly Leu Leu Ala Leu Leu Leu Leu Ala Leu 
15 10 15 

fto 

rU CTG GGC CCC GGC GGC CGA GGG GTG GGC CGG CCG GGC AGC GGG GCA CAG 96 

n Leu Gly Pro Gly Gly Arg Gly Val Gly Arg Pro Gly Ser Gly Ala Gin 
S; 20 25 30 

45 GCG GGG GCG GGG CGC TGG GCC CAA CGC TTC AAG GTG GTC TTT GCG CCT 144 

Ala Gly Ala Gly Arg Trp Ala Gin Arg Phe Lys Val Val Phe Ala Pro 
35 40 45 

GTG ATC TGC AAG CGG ACC TGT CTG AAG GGC CAG TGT CGG GAC AGC TGT 192 
50 Val He Cys Lys Arg Thr Cys Leu Lys Gly Gin Cys Arg Asp Ser Cys 
50 55 60 

CAG CAG GGC TCC AAC ATG ACG CTC ATC GGA GAG AAC GGC CAC AGC ACC 240 
Gin Gin Gly Ser Asn Met Thr Leu He Gly Glu Asn Gly His Ser Thr 
55 65 70 75 80 

GAC ACG CTC ACC GGT TCT GCC TTC CGC GTG GTG GTG TGC CCT CTA CCC 288 
Asp Thr Leu Thr Gly Ser Ala Phe Arg Val Val Val Cys Pro Leu Pro 
85 90 95 



TGC ATG AAC GGT GGC CAG TGC TCT TCC CGA AAC CAG TGC CTG TGT CCC 336 
Cys Met Asn Gly Gly Gin Cys Ser Ser Arg Asn Gin Cys Leu Cys Pro 
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100 105 110 

CCG GAT TTC ACG GGG CGC TTC TGC CAG GTG CCT GCT GCA GGA ACC GGA 384 
Pro Asp Phe Thr Gly Arg Phe Cys Gin Val Pro Ala Ala Gly Thr Gly 
115 120 125 

GCT GGC ACC GGG AGT TCA GGC CCC GGC TGG CCC GAC CGG GCC ATG TCC 432 
Ala Glv Thr Gly Ser Ser Gly Pro Gly Trp Pro Asp Arg Ala Met Ser 
130 135 140 

ACA GGC CCG CTG CCG CCC CTT GCC CCA GAA GGA GAG TCT GTG GCT AGC 480 
Thr Glv Pro Leu Pro Pro Leu Ala Pro Glu Gly Glu Ser Val Ala Ser 
145 150 155 160 

15 AAA CAC GCC ATT TAC GCG GTG CAG GTG ATC GCA GAT CCT CCC GGG CCG 528 

Lys His Ala lie Tyr Ala Val Gin Val He Ala Asp Pro Pro Gly Pro 
165 170 175 

GGG GAG GGT CCT CCT GCA CAA CAT GCA GCC TTC TTG GTG CCC CTG GGG 576 
20 Glv Glu Gly Pro Pro Ala Gin His Ala Ala Phe Leu Val Pro Leu Gly 

180 185 190 

CCA GGA CAA ATC TCG GCA GAA GTG CAG GCT CCG CCC CCC GTG GTG AAC 624 
Pro Gly Gin He Ser Ala Glu Val Gin Ala Pro Pro Pro Val Val Asn 
25 195 200 205 



m 

•est z 

■ H 



GTG CGT GTC CAT CAC CCT CCT GAA GCT TCC GTT CAG GTG CAC CGC ATC 672 
Val Arg Val His His Pro Pro Glu Ala Ser Val Gin Val His Arg He 
210 215 220 

#) 

rU GAG GGG CCG AAC GCT GAA GGC CCA GCC TCT TCC CAG CAC TTG CTG CCG 720 

fn Glu Gly Pro Asn Ala Glu Gly Pro Ala Ser Ser Gin His Leu Leu Pro 

225 230 235 240 



768 



% CAT CCC AAG CCC j:CG CAC CCG AGG CCA CCC ACT CAA AAG CCA CTG GGC 

5 His Pro Lys Pro Pro His Pro Arg Pro Pro Thr Gin Lys Pro Leu Gly 

245 250 255 

CGC TGC TTC CAG GAC ACA TTG CCC AAG CAG CCT TGT GGC AGC AAC CCT 816 
Mo Arg Cys Phe Gin Asp Thr Leu Pro Lys Gin Pro Cys Gly Ser Asn Pro 
rU 260 265 270 

2 TTG CCT GGC CTT ACC AAG CAG GAA GAT TGC TGC GGT AGC ATC GGT ACT 864 

y Leu Pro Gly Leu Thr Lys Gin Glu Asp Cys Cys Gly Ser He Gly Thr 

45 275 280 285 

GCC TGG GGA CAA AGC AAG TGT CAC AAG TGC CCA CAG CTT CAG TAT ACA 912 
Ala Trp Gly Gin Ser Lys Cys His Lys Cys Pro Gin Leu Gin Tyr Thr 
290 295 300 

GGG GTG CAG AAG CCT GTA CCT GTA CGT GGG GAG GTG GGT GCT GAC TGC 960 
Gly Val Gin Lys Pro Val Pro Val Arg Gly Glu Val Gly Ala Asp Cys 
305 310 315 320 

55 CCC CAG GGC TAC AAG AGG CTC AAC AGC ACC CAC TGC CAG GAT ATC AAC 1008 

Pro Gin Gly Tyr Lys Arg Leu Asn Ser Thr His Cys Gin Asp He Asn 
325 330 335 

GAA TGT GCG ATG CCC GGG AAT GTG TGC CAT GGT GAC TGC CTC AAC AAC 1056 
60 Glu Cys Ala Met Pro Gly Asn Val Cys His Gly Asp Cys Leu Asn Asn 

340 345 350 
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CCT GGC TCT TAT CGC TGT GTC TGC CCG CCC GGT CAT AGC TTG GOT CCC 1104 
Pro Gly Ser Tyr Arg Cys Val Cys Pro Pro Gly His Ser Leu Gly Pro 
355 360 365 

5 CTC GCA GCA GAG TGC ATT GCC GAC AAA CCA GAG GAG AAG AGC CTG TGT 1152 

Leu Ala Ala Gin Cys lie Ala Asp Lys Pro Glu Glu Lys Ser Leu Cys 
370 375 380 

TTC CGC CTT GTG AGC ACC GAA CAC CAG TGC CAG CAC CCT CTG ACC ACA 1200 
10 Phe Arg Leu Val Ser Thr Glu His Gin Cys Gin His Pro Leu Thr Thr 
385 390 395 400 

CGC CTA ACC CGC CAG CTC TGC TGC TGT AGT GTG GGT AAA GCC TGG GGT 1248 
Arg Leu Thr Arg Gin Leu Cys Cys Cys Ser Val Gly Lys Ala Trp Gly 
15 405 410 415 

GCC CGG TGC CAG CGC TGC CCG GCA GAT GGT ACA GCA GCC TTC AAG GAG 1296 
Ala Arg Cys Gin Arg Cys Pro Ala Asp Gly Thr Ala Ala Phe Lys Glu 
420 425 430 

20 

ATC TGC CCC GGC TGG GAA AGG GTA CCA TAT CCT CAC CTC CCA CCA GAC 1344 
He Cys Pro Gly Trp Glu Arg Val Pro Tyr Pro His Leu Pro Pro Asp 
435 440 445 

2i GCT CAC CAT CCA GGG GGA AAG CGA CTT CTC CCT CTT CCT GCA CCC GAC 13 92 

Ala His His Pro Gly Gly Lys Arg Leu Leu Pro Leu Pro Ala Pro Asp 
^ 450 455 460 

5 GGG CCA CCC AAA CCC CAG CAG CTT CCT GAA AGC CCC AGC CGA GCA CCA 1440 

ffi Glv Pro Pro Lys Pro Gin Gin Leu Pro Glu Ser Pro Ser Arg Ala Pro 
lU 465 470 475 480 

S CCC CTC GAG GAC ACA GAG GAA GAG AGA GGA GTG ACC ATG GAT CCA CCA 1488 
ft Pro Leu Glu Asp Thr Glu Glu Glu Arg Gly Val Thr Met Asp Pro Pro 
35 _485 490 495 

GTG AGT GAG GAG CGA TCG GTG CAG CAG AGC CAC CCC ACT ACC ACC ACC 1536 
Val Ser Glu Glu Arg Ser Val Gin Gin Ser His Pro Thr Thr Thr Thr 
I, 500 505 510 

TCA CCC CCC CGG CCT TAG CCA GAG CTC ATC TCT CGC CCC TCC CCA CCT 1584 
a Ser Pro Pro Arg Pro Tyr Pro Glu Leu He Ser Arg Pro Ser Pro Pro 



ffi 



515 520 525 

45 ACC TTC CAC CGG TTC CTG CCA GAC TTG CCC CCA TCC CGA AGT GCA GTG 1632 

Thr Phe His Arg Phe Leu Pro Asp Leu Pro Pro Ser Arg Ser Ala Val 
530 535 540 

GAG ATC GCC CCC ACT CAG GTC ACA GAG ACC GAT GAG TGC CGA TTG AAC 1680 
50 Glu He Ala Pro Thr Gin Val Thr Glu Thr Asp Glu Cys Arg Leu Asn 
545 550 555 560 

CAG AAT ATC TGT GGC CAT GGA CAG TGT GTG CCT GGC CCC TCG GAT TAG 1728 
Gin Asn He Cys Gly His Gly Gin Cys Val Pro Gly Pro Ser Asp Tyr 
55 565 570 575 

TCC TGC CAC TGC AAC GCT GGC TAG CGG TCA CAC CCG CAG CAC CGC TAC 1776 
Ser Cys His Cys Asn Ala Gly Tyr Arg Ser His Pro Gin His Arg Tyr 
580 585 590 



60 



TGT GTT GAT GTG AAC GAG TGC GAG GCA GAG CCC TGC GGC CCC GGG AAA 1824 
Cys Val Asp Val Asn Glu Cys Glu Ala Glu Pro Cys Gly Pro Gly Lys 
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10 



50 



595 600 605 

GGC ATC TGT ATG AAC ACT GGT GGC TCC TAG AAT TGT CAC TGC AAC CGA 1872 
Gly He Cys Met Asn Thr Gly Gly Ser Tyr Asn Cys His Cys Asn Arg 
610 615 620 

GGC TAC CGC CTC CAC GTG GGT GCA GGG GGC CGC TCG TGC GTG GAC CTG 192 0 

Gly Tyr Arg Leu His Val Gly Ala Gly Gly Arg Ser Cys Val Asp Leu 
625 630 635 640 

AAC GAG TGC GCC AAG CCT CAC CTG TGT GGG GAC GGT GGC TTC TGC ATC 1968 
Asn Glu Cys Ala Lys Pro His Leu Cys Gly Asp Gly Gly Phe Cys He 
645 650 655 



15 AAC TTC CCT GGT CAC TAC AAA TGC AAC TGC TAT CCT GGC TAC CGG CTC 2016 

Asn Phe Pro Gly His Tyr Lys Cys Asn Cys Tyr Pro Gly Tyr Arg Leu 
660 665 670 

AAG GCC TCC CGA CCG CCC ATT TGC GAA GAC ATC GAC GAG TGT CGC GAC 2064 
20 Lys Ala Ser Arg Pro Pro He Cys Glu Asp He Asp Glu Cys Arg, Asp 
675 680 685 

CCT AGC ACC TGC CCT GAT GGC AAA TGT GAA AAC AAA CCT GGC AGC TTC 2112 
Pro Ser Thr Cys Pro Asp Gly Lys Cys Glu Asn Lys Pro Gly Ser Phe 
25 690 695 700 

£i AAG TGC ATC GCC TGC CAG CCT GGC TAC CGT AGC CAG GGG GGC GGG GCC 2160 

Tp Lys Cys He Ala Cys Gin Pro Gly Tyr Arg Ser Gin Gly Gly Gly Ala 

705 710 715 720 

^0 

rU TGT CGT GAT GTC AAC GAA TGC TCC GAA GGT ACC CCC TGC TCT CCT GGA 2208 

m Cys Arg Asp Val Asn Glu Cys Ser Glu Gly Thr Pro Cys Ser Pro Gly 

725 730 735 

^S%5 TGG TGT GAG AAAJTTT CCG GGT TCT TAC CGT TGC ACG TGT GCC CAG GGG 22 56 

s Trp Cys Glu Lys Leu Pro Gly Ser Tyr Arg Cys Thr Cys Ala Gin Gly 

Q 740 745 750 

ATA CGA ACC CGC ACA GGA CGC CTC AGT TGC ATA GAC GTG GAT GAC TGT 2 3 04 

HO He Arg Thr Arg Thr Gly Arg Leu Ser Cys He Asp Val Asp Asp Cys 

fU 755 760 765 

y GAG GCT GGG AAA GTG TGC CAA GAT GGC ATC TGC ACG AAC ACA CCA GGC 23 52 

Glu Ala Gly Lys Val Cys Gin Asp Gly He Cys Thr Asn Thr Pro Gly 
45 770 775 780 

TCT TTC CAG TGT CAG TGC CTC TCC GGC TAT CAT CTG TCA AGG GAT CGG 2400 
Ser Phe Gin Cys Gin Cys Leu Ser Gly Tyr His Leu Ser Arg Asp Arg 
785 790 795 800 



AGC CGC TGT GAG GAC ATT GAT GAA TGT GAC TTC CCT GCG GCC TGC ATC 2448 
Ser Arg Cys Glu Asp He Asp Glu Cys Asp Phe Pro Ala Ala Cys He 
805 810 815 



55 GGG GGT GAC TGC ATC AAT ACC AAT GGT TCC TAC AGA TGT CTC TGT CCC 2496 

Gly Gly Asp Cys He Asn Thr Asn Gly Ser Tyr Arg Cys Leu Cys Pro 
820 825 830 

CTG GGT CAT CGG TTG GTG GGC GGC AGG AAG TGC AAG AAA GAT ATA GAT 2 544 

60 Leu Gly His Arg Leu Val Gly Gly Arg Lys Cys Lys Lys Asp He Asp 
835 840 845 
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GAG TGC AGC GAG GAG CCA GGC CTG TGC CTG CCC CAT GCC TGC GAG AAC 2592 
Glu Cys Ser Gin Asp Pro Gly Leu Cys Leu Pro His Ala Cys Glu Asn 
850 855 860 

5 CTC CAG GGC TCC TAT GTC TGT GTC TGT GAT GAG GGT TTC ACA CTC ACC 2640 

Leu Gin Gly Ser Tyr Val Cys Val Cys Asp Glu Gly Phe Thr Leu Thr 
865 870 875 880 

CAG GAC CAG CAT GGG TGT GAG GAG GTG GAG CAG CCC CAC CAC AAG AAG 2688 
10 Gin Asp Gin His Gly Cys Glu Glu Val Glu Gin Pro His His Lys Lys 

885 890 895 

GAG TGC TAC CTT AAC TTC GAT GAC ACA GTG TTC TGT GAC AGC GTA TTG 2736 
Glu Cys Tyr Leu Asn Phe Asp Asp Thr Val Phe Cys Asp Ser Val Leu 
15 900 905 910 

GCT ACC AAT GTC ACT CAG CAG GAA TGC TGT TGC TCT CTG GGA GCT GGC 2 784 

Ala Thr Asn Val Thr Gin Gin Glu Cys Cys Cys Ser Leu Gly Ala Gly 
915 920 925 



20 



in 



60 



TGG GGA GAC CAC TGC GAA ATC TAT CCC TGT CCA GTC TAC AGC TCA GCC 2832 
Trp Gly Asp His Cys Glu He Tyr Pro Cys Pro Val Tyr Ser Ser Ala 
930 935 940 



25 GAA TTT CAC AGC CTG GTG CCT GAT GGG AAA AGG CTA CAC TCA GGA CAA 2880 

P Glu Phe His Ser Leu Val Pro Asp Gly Lys Arg Leu His Ser Gly Gin 



945 950 955 960 



CAA CAT TGT GAA CTA TGC ATT CCT GCC CAC CGT GAC ATC GAC GAA TGC 2928 
^0 Gin His Cys Glu Leu Cys He Pro Ala His Arg Asp He Asp Glu Cys 
ry 965 970 975 

?4 ATA TTG TTT GGG GCA GAG ATC TGC AAG GAG GGC AAG TGT GTG AAC TCG 2 976 

¥^ lie Leu Phe Gly Ala Glu He Cys Lys Glu Gly Lys Cys Val Asn Ser 
IQ5 980_ _ _ - 985 990 

b CAG CCC GGC TAC GAG TGC TAC TGC AAG CAG GGC TTC TAC TAC GAT GGC 3024 

S Gin Pro Gly Tyr Glu Cys Tyr Cys Lys Gin Gly Phe Tyr Tyr Asp Gly 
995 1000 1005 

Ho 

fy AAC CTG CTG GAG TGC GTG GAC GTG GAC GAG TGC TTG GAT GAG TCT AAC 3072 

P= Asn Leu Leu Glu Cys Val Asp Val Asp Glu Cys Leu Asp Glu Ser Asn 

S 1010 1015 1020 

I t 

45 TGC AGG AAC GGA GTG TGT GAG AAC ACG TGG CGG CTA CCG TGT GCC TGC 3120 

Cys Arg Asn Gly Val Cys Glu Asn Thr Trp Arg Leu Pro Cys Ala Cys 
1025 1030 1035 1040 

ACT CCG CCG GCA GAG TAC AGT CCC GCA CAG GCC CAG TGT CTG ATC CCG 3168 
50 Thr Pro Pro Ala Glu Tyr Ser Pro Ala Gin Ala Gin Cys Leu He Pro 

1045 1050 1055 

GAG AGA TGG AGC ACG CCC CAG AGA GAC GTG AAG TGT GCT GGG GCC AGC 3216 
Glu Arg Trp Ser Thr Pro Gin Arg Asp Val Lys Cys Ala Gly Ala Ser 
55 1060 1065 1070 

GAG GAG AGG ACG GCA TGT GTA TGG GGC CCC TGG GCG GGA CCT GCC CTC 3264 
Glu Glu Arg Thr Ala Cys Val Trp Gly Pro Trp Ala Gly Pro Ala Leu 
1075 1080 1085 



ACT TTT GAT GAC TGC TGC TGC CGC CAG CCG CGG CTG GGT ACC CAG TGC 3312 
Thr Phe Asp Asp Cys Cys Cys Arg Gin Pro Arg Leu Gly Thr Gin Cys 
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1090 



1095 1100 



10 



AGA CCG TGC CCG CCA CGT GGC ACC GGG TCC CAG TGC CCG ACT TCA CAG 3360 
Ara Pro Cys Pro Pro Arg Gly Thr Gly Ser Gin Cys Pro Thr Ser Gin 
^1105 1110 1115 1120 

AGT GAG AGC AAT TCT TTC TGG GAC ACA AGC CCC CTG CTA CTG GGG AAG 3408 
Ser Glu Ser Asn Ser Phe Trp Asp Thr Ser Pro Leu Leu Leu Gly Lys 
1125 1130 1135 

TCT CCG CGA GAC GAA GAC AGC TCA GAG GAG GAT TCA GAT GAG TGC CGT 3456 
Ser Pro Arg Asp Glu Asp Ser Ser Glu Glu Asp Ser Asp Glu Cys Arg 
1140 1145 1150 

15 TGT GTG AGC GGA CCG TGT GTG CCA CGG CCA GGC GGG GCG GTA TGC GAG 3 504 

Cys Val Ser Gly Pro Cys Val Pro Arg Pro Gly Gly Ala Val Cys Glu 
1155 1160 1165 

TGT CCT GGA GGC TTT CAG CTG GAC GCC TCC CGT GCC CGC TGC GTG GAC 3 552 

20 Cys Pro Gly Gly Phe Gin Leu Asp Ala Ser Arg Ala Arg Cys Val Asp 
1170 1175 1180 

ATT GAT GAG TGC CGA GAA CTG AAC CAG CGG GGA CTG CTG TGT AAG AGC 3600 
He Asp Glu Cys Arg Glu Leu Asn Gin Arg Gly Leu Leu Cys Lys Ser 
^^25 1185 1190 1195 1200 

^ GAG CGG TGC GTG AAC ACC AGT GGA TCC TTC CGC TGT GTC TGC AAA GCT 3648 

Glu Arg Cys Val Asn Thr Ser Gly Ser Phe Arg Cys Val Cys Lys Ala 
yj ^ 1205 1210 1215 

^30 

rii GGC TTC ACG CGC AGC CGC CCT CAC GGG CCT GCG TGC CTC AGC GCC GCC 3696 

m Glv Phe Thr Arq Ser Arg Pro His Gly Pro Ala Cys Leu Ser Ala Ala 

J ^ 1220 1225 1230 

^>5 GCT GAT GAT GCAJ3CC ATA GCC^C ACC TCA GTG ATC GAT CAT CGA GGG 3 744 

Ala Asp Asp Ala Ala He Ala His Thr Ser Val He Asp His Arg Gly 
1235 1240 1245 

0' TAT TTT CAC ^'^^■^ 

1^0 Tyr Phe His 
rU 1250 



45 



50 



60 



(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1251 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

55 Met Arg Gin Ala Ala Leu Gly Leu Leu Ala Leu Leu Leu Leu Ala Leu 
15 10 15 

Leu Gly Pro Gly Gly Arg Gly Val Gly Arg Pro Gly Ser Gly Ala Gin 
20 25 30 



Ala Gly Ala Gly Arg Trp Ala Gin Arg Phe Lys Val Val Phe Ala Pro 
35 40 45 
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Val He Cys Lys Arg Thr Cys Leu Lys Gly Gin Cys Arg Asp Ser Cys 
50 55 60 

Gin Gin Gly Ser Asn Met Thr Leu He Gly Glu Asn Gly His Ser Thr 

70 75 



65 

ASP Thr Leu Thr Gly Ser Ala Phe Arg Val Val Val Cys Pro Leu Pro 
85 90 

cys Met Asn Gly Gly Gin Cys Ser Ser Arg Asn Gin Cys Leu Cys Pro 
100 105 110 

Pro AGP Phe Gly Arg Phe Cys Gin Val Pr^ Ala Ala Gly Thr Gly 
115 120 125 

Ala Gly Thr Gly Ser Ser Gly Pro Gly Trp Pro Asp Arg Ala Met Ser 
130 135 140 

Thr Gly Pro Leu Pro Pro Leu Ala Pro Glu Gly Glu Ser Val Ala Ser 
145 150 155 160 

Lys His Ala He Tyr Ala Val Gin Val He Ala Asp Pro Pro Gly Pro 
165 170 175 

Gly Glu Gly Pro Pro Ala Gin His Ala Ala Phe Leu Val Pro Leu Gly 
180 185 190 

Pro Gly Gin lie Ser Ala Glu Val Gin Ala Pro Pro Pro Val Val Asn 
195 200 205 

Val Arg Val His His Pro Pro Glu Ala Ser Val Gin Val His Arg He 
210 215 220 

Glu Gly Pro Asn Ala Glu Gly Pro Ala Ser Ser Gin His Leu Leu Pro 
225 _ , _ 230 235 240 

His Pro Lys Pro Pro His Pro Arg Pro Pro Thr Gin Lys Pro Leu Gly 
245 250 255 

Ara CVS Phe Gin Asp Thr Leu Pro Lys Gin Pro Cys Gly Ser Asn Pro 
260 265 270 

Leu Pro Gly Leu Thr Lys Gin Glu Asp Cys Cys Gly Ser He Gly Thr 
275 280 285 

Ala Trp Gly Gin Ser Lys Cys His Lys Cys Pro Gin Leu Gin Tyr Thr 
290 295 300 

Glv Val Gin Lys Pro Val Pro Val Arg Gly Glu Val Gly Ala Asp Cys 
305 310 315 320 

Pro Gin Gly Tyr Lys Arg Leu Asn Ser Thr His Cys Gin Asp He Asn 
325 330 335 

Glu Cys Ala Met Pro Gly Asn Val Cys His Gly Asp Cys Leu Asn Asn 
340 345 350 

Pro Gly Ser Tyr Arg Cys Val Cys Pro Pro Gly His Ser Leu Gly Pro 
355 360 365 

Leu Ala Ala Gin Cys He Ala Asp Lys Pro Glu Glu Lys Ser Leu Cys 
370 375 380 
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Phe Arg Leu Val Ser Thr Glu His Gin Cys Gin His Pro Leu Thr Thr 
385 390 395 400 

Arg Leu Thr Arg Gin Leu Cys Cys Cys Ser Val Gly Lys Ala Trp Gly 
405 410 415 

Ala Arg Cys Gin Arg Cys Pro Ala Asp Gly Thr Ala Ala Phe Lys Glu 
420 425 430 

He Cys Pro Gly Trp Glu Arg Val Pro Tyr Pro His Leu Pro Pro Asp 
435 440 445 

Ala His His Pro Gly Gly Lvs Arg Leu Leu Pro Leu Pro Ala Pro Asp 
450 455 460 

Gly Pro Pro Lys Pro Gin Gin Leu Pro Glu Ser Pro Ser Arg Ala Pro 
465 470 475 480 

Pro Leu Glu Asp Thr Glu Glu Glu Arg Gly Val Thr Met Asp Pro Pro 
485 490 495 

Val Ser Glu Glu Arg Ser Val Gin Gin Ser His Pro Thr Thr Thr Thr 
500 505 510 

Ser Pro Pro Arg Pro Tyr Pro Glu Leu He Ser Arg Pro Ser Pro Pro 
515 520 525 

Thr Phe His Arg Phe Leu Pro Asp Leu Pro Pro Ser Arg Ser Ala Val 
530 535 540 

Glu He Ala Pro Thr Gin Val Thr Glu Thr Asp Glu Cys Arg Leu Asn 
545 550 555 560 

Gin Asn He Cys Gly His Gly Gin Cys Val Pro Gly Pro Ser Asp Tyr 
565 570 575 

Ser Cys His Cys Asn Ala Gly Tyr Arg Ser His Pro Gin His Arg Tyr 
580 585 590 

Cys Val Asp Val Asn Glu Cys Glu Ala Glu Pro Cys Gly Pro Gly Lys 
595 600 605 

Gly He Cys Met Asn Thr Gly Gly Ser Tyr Asn Cys His Cys Asn Arg 
610 615 620 

Gly Tyr Arg Leu His Val Gly Ala Gly Gly Arg Ser Cys Val Asp Leu 
625 630 635 640 

Asn Glu Cys Ala Lys Pro His Leu Cys Gly Asp Gly Gly Phe Cys He 
645 650 655 

Asn Phe Pro Gly His Tyr Lys Cys Asn Cys Tyr Pro Gly Tyr Arg Leu 
660 665 670 

Lys Ala Ser Arg Pro Pro He Cys Glu Asp He Asp Glu Cys Arg Asp 
675 680 685 

Pro Ser Thr Cys Pro Asp Gly Lys Cys Glu Asn Lys Pro Gly Ser Phe 
690 695 700 

Lys Cys He Ala Cys Gin Pro Gly Tyr Arg Ser Gin Gly Gly Gly Ala 
705 710 715 720 
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Cys Arg Asp Val Asn Glu Cys Ser Glu Gly Thr Pro Cys Ser Pro Gly 
725 730 735 

Trp Cys Glu Lys Leu Pro Gly Ser Tyr Arg Cys Thr Cys Ala Gin Gly 
740 745 750 

He Arg Thr Arg Thr Gly Arg Leu Ser Cys He Asp Val Asp Asp Cys 
755 760 765 

Glu Ala Gly Lys Val Cys Gin Asp Gly He Cys Thr Asn Thr Pro Gly 
770 775 780 

Ser Phe Gin Cyr Gin Cys Leu Ser Gly Tyr His Leu Ser Arg Asp Arg 
785 790 795 800 

Ser Arg Cys Glu Asp He Asp Glu Cys Asp Phe Pro Ala Ala Cys He 
805 810 815 

Gly Gly Asp Cys He Asn Thr Asn Gly Ser Tyr Arg Cys Leu Cys Pro 
820 825 830 

Leu Gly His Arg Leu Val Gly Gly Arg Lys Cys Lys Lys Asp He Asp 
835 840 845 

Glu Cys Ser Gin Asp Pro Gly Leu Cys Leu Pro His Ala Cys Glu Asn 
850 855 860 

Leu Gin Gly Ser Tyr Val Cys Val Cys Asp Glu Gly Phe Thr Leu Thr 
865 870 875 880 

Gin Asp Gin His Gly Cys Glu Glu Val Glu Gin Pro His His Lys Lys 
885 890 895 

Glu Cys Tyr Leu Asn Phe Asp Asp Thr Val Phe Cys Asp Ser Val Leu 
900_ 905 910 

Ala Thr Asn Val Thr Gin Gin Glu Cys Cys Cys Ser Leu Gly Ala Gly 
915 920 925 

Trp Gly Asp His Cys Glu He Tyr Pro Cys Pro Val Tyr Ser Ser Ala 
930 935 940 

Glu Phe His Ser Leu Val Pro Asp Gly Lys Arg Leu His Ser Gly Gin 
945 950 955 960 

Gin His Cys Glu Leu Cys He Pro Ala His Arg Asp He Asp Glu Cys 
965 970 975 

He Leu Phe Gly Ala Glu He Cys Lys Glu Gly Lys Cys Val Asn Ser 
980 985 990 

Gin Pro Gly Tyr Glu Cys Tyr Cys Lys Gin Gly Phe Tyr Tyr Asp Gly 
995 1000 1005 

Asn Leu Leu Glu Cys Val Asp Val Asp Glu Cys Leu Asp Glu Ser Asn 
1010 1015 1020 

Cys Arg Asn Gly Val Cys Glu Asn Thr Trp Arg Leu Pro Cys Ala Cys 
1025 1030 1035 1040 

Thr Pro Pro Ala Glu Tyr Ser Pro Ala Gin Ala Gin Cys Leu He Pro 
1045 1050 1055 
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Glu Arg Trp Ser Thr Pro Gin Arg Asp Val Lys Cys Ala Gly Ala Ser 
1060 1065 1070 

Glu Glu Arg Thr Ala Cys Val Trp Gly Pro Trp Ala Gly Pro Ala Leu 
5 1075 1080 1085 

Thr Phe Asp Asp Cys Cys Cys Arg Gin Pro Arg Leu Gly Thr Gin Cys 
1090 1095 1100 

10 Arg Pro Cys Pro Pro Arg Gly Thr Gly Ser Gin Cys Pro Thr Ser Gin 
1105 1110 1115 1120 



15 



£0 



HO 



50 



55 



Ser Glu Ser As- Ser Phe Trp Asp Thr Ser Pro Leu Leu Leu Gly Lys 
1125 1130 1135 

Ser Pro Arg Asp Glu Asp Ser Ser Glu Glu Asp Ser Asp Glu Cys Arg 
1140 1145 1150 



Cys Val Ser Gly Pro Cys Val Pro Arg Pro Gly Gly Ala Val Cys Glu 
20 1155 1160 1165 

Cys Pro Gly Gly Phe Gin Leu Asp Ala Ser Arg Ala Arg Cys Val Asp 
1170 1175 1180 

25 He Asp Glu Cys Arg Glu Leu Asn Gin Arg Gly Leu Leu Cys Lys Ser 

0 1185 1190 1195 1200 



Glu Arg Cys Val Asn Thr Ser Gly Ser Phe Arg Cys Val Cys Lys Ala 
1205 1210 1215 

'^0 

fli Gly Phe Thr Arg Ser Arg Pro His Gly Pro Ala Cys Leu Ser Ala Ala 

m 1220 1225 1230 



Ala Asp Asp Ala Ala He Ala His Thr Ser Val He Asp His Arg Gly 
1235^ _ _J;240 1245 

Tyr Phe His 
1250 



(2) INFORMATION FOR SEQ ID NO : 5 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 
45 (B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Gly Glu Ser Val Ala Ser Lys His Ala He Tyr Ala Val Cys 
15 10 

(2) INFORMATION FOR SEQ ID NO : 6 : 



(i) SEQUENCE CHARACTERISTICS: 
60 (A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: Other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

TACCGATGCT ACCGCAGCAA TCTT 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

ATGCCTAAAC TCTACCAGCA CG 



(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
" (aT description! 73esc = "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 

GAGTCACGTC ATCCATTCCA CA 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 

CGTCCAAGTT GTGTCTTAGC AG 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

5 (ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Gly Pro Pro Gly Pro Gin Gly Ala Thr Gly Pro Leu Gly Pro Lys Gly 
10 1 5 10 15 

Gin Thr Gly Glu Pro Gly lie Ala Gly Phe Lys Gly Glu Gin Gly Pro 
20 25 30 

15 Lys Gly Glu Thr Gly Pro Ala Gly Pro Gin Gly Ala Pro Gly Pro Ala 

35 40 45 



20 



55 



60 



Gly Glu Glu Gly Lys 
50 



(2) INFORMATION FOR SEQ ID NO: 11: 



(i) SEQUENCE CHARACTERISTICS: 

25 (A) LENGTH: 159 base pairs 

g (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

% (D) TOPOLOGY: linear 
y ; 

'^fO (ii) MOLECULE TYPE: other nucleic acid 

fU (A) DESCRIPTION: /desc = "DNA" 

^J. (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

W5 GGCCCTCCCG GTCCTCAAGG TGCAACTGGT CCTCTGGGCC CCAAAGGTCA GACGGGTGAG 6 0 

CCCGGCATCG CTGGCTTCAA AGGTGAACAA GGCCCCAAGG GAGAGACTGG ACCTGCTGGG 12 0 

ff'' CCCCAGGGAG CCCCTGGCCC TGCTGGTGAA GAAGGAAAA 159 

m 

IX, (2) INFORMATION FOR SEQ ID NO: 12: 

O (i) SEQUENCE CHARACTERISTICS: 

45 (A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

50 (ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desC = "DNA" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
AAACGTCACA CGTGANACGT GAACGTTGCT TGCTGG 3 5 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 32 base pairs 
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(B) 
(C) 
(D) 



•TYPE: nucleic acid 
STRANDEDNESS : single 
iLOGY: linear 



(ii) MOLECULE YYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 

TTACGTCCAC GTACACGTCT VcCAAGCAAG CA 
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